Location
California, United States
Salary
Not specified
Type
fulltime
Posted
Today
Job Description
Connect with me and feel free to apply if you’re a Researcher or Machine Learning Engineer and aren’t sure you’re the perfect fit.
I’m currently working with two elite, well-funded teams in the Bay Area (SF \& Palo Alto) that are taking Reinforcement Learning in polar opposite, but equally ambitious, directions:
1\. The Foundational Research Lab (San Francisco)
Founded by "old-school" RL PhDs, this lab believes the field is scaling prematurely while ignoring core issues like data inefficiency and long action horizons. Backed by
Vercel and South Park Commons
, they are building a research-driven environment to scale RL-LLM hybrids by orders of magnitude.
- The Mission:
Moving past DPO/RLHF to create agents that genuinely generalize.
- The Team:
Talent from
DeepMind, Meta, and NVIDIA
.
2\. The "Ground Truth" Reasoning Startup (Palo Alto)
A
Stanford spinout
solving for long-horizon reasoning by moving RL into a space where physics and logic provide a non-negotiable ground truth:
Chip Design.
They are building a "Cursor for Verilog" where agents must plan, critique, and verify their own code against real execution feedback.
- The Mission:
Collapsing the 3-year hardware design cycle through automated reasoning.
- The Team:
Led by the former
Head of AI (Trust \& Safety) at Anthropic
, with peers from
xAI and OpenAI
.
- The Backing:
Top-tier VCs with support from figures like Jeff Dean.
Both teams are looking for "founding-level" engineers who can ship production-grade systems, not just run experiments.
If either of these philosophies:
foundational scaling
or
physical verification
align with where you want to take your next career move, feel free to apply.
Looking for more opportunities?
Browse thousands of graduate jobs and entry-level positions.