Pinned
OpenAI
1,908 posts
OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: openai.com/jobs
Joined December 2015
- GPT-5.4 helped drive a medicinal chemistry project from literature review to a validated experimental result. Paired with Molecule.one’s Maria AI and specialized lab, the model proposed an unexpected way to improve a widely used reaction in drug discovery.
00:00Replying to @OpenAIMaria tested the idea across 10,080 reactions, and human chemists later validated representative results by hand. Under the optimized conditions, yields improved for 88% of the boronic acids and 83% of the sulfonamides tested. Human chemists then repeated 14 representative - We’re sharing new research on a method for anticipating how models may behave in real-world use before release: simulating deployment with recent, de-identified user requests and studying candidate model responses.Replying to @OpenAISimulated deployments also reduced evaluation awareness to levels close to real production traffic. We extended the method to agentic deployments with stateful tools, showing that tool simulators can produce realistic trajectories when given sufficient context and capabilities.Deployment Simulation works best with representative production data, which external evaluators often can’t access. In a companion post for our Alignment blog, we also explore the public WildChat dataset and find that, while less precise, it still provides a useful signal about
- Let’s talk about evals. We’re always looking for better ways to measure and forecast model progress, especially as benchmarks get saturated or gamed. @tejalpatwardhan, who leads our frontier evals team, spoke to @AndrewMayne about why evals matter and what models need to be
00:00Listen to the OpenAI Podcast on— Spotify open.spotify.com/episode/5pkjhU… Apple podcasts.apple.com/us/podcast/why… YouTube youtu.be/CFqjjKp9Y-Q - OpenAI repostedMore of Codex is rolling out across Europe this week. We’re bringing Computer use, the Codex Chrome extension, personalized memory, and Chronicle to Codex users in the EEA, UK, and Switzerland.
- OpenAI repostedThe north stars we're working towards at OpenAI all center around the mission: ensure AGI benefits all of humanity. AI should expand human agency, not make people less consequential to the future.
- An issue caused some user accounts to be incorrectly suspended. We’re restoring access and working through related subscription and credit issues.
- What happened when one of our models found a counterexample to an 80-year-old Erdős conjecture? Researchers @alexwei_, @HongxunWu, and @wjmzbmr1 shared the story on the OpenAI Podcast with @AndrewMayne and explained how mathematicians and models can work together to make new
00:00Listen to the OpenAI Podcast on— Spotify open.spotify.com/episode/3ca5s3… Apple podcasts.apple.com/us/podcast/how… YouTube youtu.be/wNWz5Hbh5VQ









