#research

216 posts

4 Jun

4 Jun 2026 1 min read

Dreaming: Better memory for a more helpful ChatGPT

ChatGPT introduces a new memory system to better remember preferences, keeping context fresh and relevant across conversations.

research

3 Jun

Isha Salian 3 Jun 2026 5 min read

NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale

Nvidia

What makes a robot gripper useful isn’t that it can pick up one object — it’s that it can pick up the next one, and the one after that, with a tool it’s never held before. What makes an autonomous vehicle system safe isn’t just that it can reason through a situation — it’s that […]

driving research robotics isaacnvidia research

28 May

Katie Washabaugh 28 May 2026 7 min read

NVIDIA Research Advances Robotics From Simulation to the Real World

Nvidia

Robotics is entering a new phase: moving from controlled demos and scripted automation toward generalizable, reliable embodied autonomy in the real world. At the International Conference on Robotics and Automation (ICRA), eight of NVIDIA Research’s 28 accepted papers show how simulation-to-real transfer is becoming a foundation for that shift, helping robots perceive, reason, plan and […]

research robotics isaacnvidia researchomniverse

Criteo Tech 28 May 2026 13 min read

What Stood Out at ICLR 2026: Criteo Papers and Research Highlights

Criteo

Authors: Ahmed Ben Yahmed , Antoine Schnepf , Karim Kassab , and Mélissa Tamine . The 14th International Conference on Learning Representations ( ICLR 2026 ) was held from April 23 to 27, 2026, at the Riocentro Convention and Event Center in Rio de Janeiro, Brazil. It was the first time the conference made its way to South America. As…

iclrllm research agentic-ai ai

20 May

20 May 2026 1 min read

An OpenAI model has disproved a central conjecture in discrete geometry

OpenAI Engineering

An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven mathematics.

research

12 May

12 May 2026 1 min read

What Parameter Golf taught us about AI-assisted research

OpenAI Engineering

Parameter Golf brought together 1,000+ participants and 2,000+ submissions to explore AI-assisted machine learning research, coding agents, quantization, and novel model design under strict constraints.

research

7 May

Brian Caulfield 7 May 2026 4 min read

Powering the Next American Century: US Energy Secretary Chris Wright and NVIDIA’s Ian Buck on the Genesis Mission

Nvidia

AI will help build the energy it needs. That’s the case U.S. Energy Secretary Chris Wright and NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck made Thursday morning at the SCSP AI+ Expo. The 30-minute fireside chat, moderated by SCSP president Ylli Bajraktari, was called “Powering the Next American Century.” Their argument: American […]

ai infrastructure corporate hardware research supercomputing

30 Apr

Criteo Tech 30 Apr 2026 8 min read

The Grind Behind the Epiphany: A Short Story of a Research Project

Criteo

Author: Alain Rakotomamonjy From ideation to outcome, this is the story of a privacy-preserving research project. It tells how research can generate innovations but also joy and despair. Early 2024: The “Hammer” Phase Two research leads who pioneered Criteo’s early privacy initiatives, as part of the Criteo multi-year research program, introduced me to a challenge born from the Privacy Sandbox…

research-paperpaperresearch algorithmsscience

23 Apr

Brian Caulfield 23 Apr 2026 1 min read

Making Sense of the Early Universe

Nvidia

This Spring Astronomy Day, here’s a look at how AI and GPUs are helping astronomers work through unprecedented volumes of cosmic data.

ai research ai for good artificial intelligence computer vision

22 Apr

22 Apr 2026 1 min read

Introducing OpenAI Privacy Filter

OpenAI Engineering

OpenAI Privacy Filter is an open-weight model for detecting and redacting personally identifiable information (PII) in text with state-of-the-art accuracy

research

16 Apr

16 Apr 2026 1 min read

Introducing GPT-Rosalind for life sciences research

OpenAI Engineering

OpenAI introduces GPT-Rosalind, a frontier reasoning model built to accelerate drug discovery, genomics analysis, protein reasoning, and scientific research workflows.

research

7 Apr

João Bernardo Narciso 7 Apr 2026 8 min read

Uncovering the Shape of Fraud with Cosmos Explorer: Visual Metaphors Behind Millions of…

Feedzai

Uncovering the Shape of Fraud with Cosmos Explorer: Visual Metaphors Behind Millions of Transactions The Data Visualization Research team is developing Cosmos Explorer, an interface that leverages universe-related visual metaphors to convey information about the billions of transactions processed by Feedzai. Pedro Cruz, professor at Northeastern University, partnered with Feedzai to bring this idea to life by contributing with his…

datavizfrauddesign researchfraud-detection

25 Mar

25 Mar 2026 1 min read

Inside our approach to the Model Spec

OpenAI Engineering

Learn how OpenAI’s Model Spec serves as a public framework for model behavior, balancing safety, user freedom, and accountability as AI systems advance.

research

12 Mar

Yiwen Xu 12 Mar 2026 3 min read

Flexibility Over Lock-In: The Enterprise Shift in Agent Strategy

Docker

Building agents is now a strategic priority for 95% of respondents in our latest State of Agentic AI research, which surveyed more than 800 developers and decision-makers worldwide. The shift is happening quickly: agent adoption has moved beyond experiments and demos into early operational maturity. But the road to enterprise-scale adoption is still complex. The...

enterprise research ai ml docker

10 Mar

Yiwen Xu 10 Mar 2026 3 min read

What’s Holding Back AI Agents? It’s Still Security

Docker

It’s hard to find a team today that isn’t talking about agents. For most organizations, this isn’t a “someday” project anymore. Building agents is a strategic priority for 95% of respondents that we surveyed across the globe with 800+ developers and decision makers in our latest State of Agentic AI research. The shift is happening...

enterprise research ai ml docker

10 Mar 2026 1 min read

Improving instruction hierarchy in frontier LLMs

OpenAI Engineering

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.

research

5 Mar

5 Mar 2026 1 min read

Reasoning models struggle to control their chains of thought, and that’s good

OpenAI Engineering

OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.

research

4 Mar

4 Mar 2026 1 min read

Extending single-minus amplitudes to gravitons

OpenAI Engineering

A new preprint extends single-minus amplitudes to gravitons, with GPT-5.2 Pro helping derive and verify nonzero graviton tree amplitudes in quantum gravity.

research

23 Feb

23 Feb 2026 1 min read

Why we no longer evaluate SWE-bench Verified

OpenAI Engineering

SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro.

research

20 Feb

Yiwen Xu 20 Feb 2026 2 min read

State of Agentic AI Report: Key Findings

Docker

Based on Docker’s State of Agentic AI report, a global survey of more than 800 developers, platform engineers, and technology decision-makers, this blog summarizes key findings of what's really happening as agentic AI scales within organizations. Drawing on insights from decision-makers and purchase influencers worldwide, we'll give you a preview on not only where teams...

research ai ml developers enterprise

20 Feb 2026 1 min read

Our First Proof submissions

OpenAI Engineering

We share our AI model’s proof attempts for the First Proof math challenge, testing research-grade reasoning on expert-level problems.

research

18 Feb

18 Feb 2026 1 min read

Introducing EVMbench

OpenAI Engineering

OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agents’ ability to detect, patch, and exploit high-severity smart contract vulnerabilities.

research

13 Feb

13 Feb 2026 1 min read

GPT-5.2 derives a new result in theoretical physics

OpenAI Engineering

A new preprint shows GPT-5.2 proposing a new formula for a gluon amplitude, later formally proved and verified by OpenAI and academic collaborators.

research

5 Feb

5 Feb 2026 1 min read

GPT-5 lowers the cost of cell-free protein synthesis

OpenAI Engineering

An autonomous lab combining OpenAI’s GPT-5 with Ginkgo Bioworks’ cloud automation cut cell-free protein synthesis costs by 40% through closed-loop experimentation.

research

8 Jan

Zoe Kessler 8 Jan 2026 3 min read

Japan Science and Technology Agency Develops NVIDIA-Powered Moonshot Robot for Elderly Care

Nvidia

The next universal technology since the smartphone is on the horizon — and it may be a little less pocket friendly. The Moonshot research program, funded by the Japan Science and Technology Agency and accelerated by NVIDIA AI and robotics technologies, is working to create a world by 2050 where AI-powered, autonomously learning robots are […]

research robotics ai for good gpu healthcare and life sciences

22 Dec 2025

Zoe Kessler 22 Dec 2025 4 min read

Marine Biological Laboratory Explores Human Memory With AI and Virtual Reality

Nvidia

The works of Plato state that when humans have an experience, some level of change occurs in their brain, which is powered by memory — specifically long-term memory. This change is what Andre Fenton, professor of neural science at New York University, and Abhishek Kumar, assistant professor of cell and regenerative biology at the University […]

ai researchworkstationeducation nvidia rtx

18 Dec 2025

18 Dec 2025 1 min read

Evaluating chain-of-thought monitorability

OpenAI Engineering

OpenAI introduces a new framework and evaluation suite for chain-of-thought monitorability, covering 13 evaluations across 24 environments. Our findings show that monitoring a model’s internal reasoning is far more effective than monitoring outputs alone, offering a promising path toward scalable control as AI systems grow more capable.

research

17 Dec 2025

Zoe Kessler 17 Dec 2025 4 min read

UC San Diego Lab Advances Generative AI Research With NVIDIA DGX B200 System

Nvidia

The Hao AI Lab research team at the University of California San Diego — at the forefront of pioneering AI model innovation — recently received an NVIDIA DGX B200 system to elevate their critical work in large language model inference. Many LLM inference platforms in production today, such as NVIDIA Dynamo, use research concepts that […]

ai ai infrastructure research supercomputing artificial intelligence

16 Dec 2025

16 Dec 2025 1 min read

Evaluating AI’s ability to perform scientific research tasks

OpenAI Engineering

OpenAI introduces FrontierScience, a benchmark testing AI reasoning in physics, chemistry, and biology to measure progress toward real scientific research.

research

16 Dec 2025 1 min read

Measuring AI’s capability to accelerate biological research

OpenAI Engineering

OpenAI introduces a real-world evaluation framework to measure how AI can accelerate biological research in the wet lab. Using GPT-5 to optimize a molecular cloning protocol, the work explores both the promise and risks of AI-assisted experimentation.

research

4 Dec 2025

Sylvia Chanak 4 Dec 2025 2 min read

NVIDIA Awards up to $60,000 Research Fellowships to PhD Students

Nvidia

For 25 years, the NVIDIA Graduate Fellowship Program has supported graduate students doing outstanding work relevant to NVIDIA technologies. Today, the program announced the latest awards of up to $60,000 each to 10 Ph.D. students involved in research that spans all areas of computing innovation. Selected from a highly competitive applicant pool, the awardees will […]

ai research artificial intelligence education

3 Dec 2025

3 Dec 2025 1 min read

How confessions can keep language models honest

OpenAI Engineering

OpenAI researchers are testing “confessions,” a method that trains models to admit when they make mistakes or act undesirably, helping improve AI honesty, transparency, and trust in model outputs.

research

1 Dec 2025

Bryan Catanzaro 1 Dec 2025 6 min read

At NeurIPS, NVIDIA Advances Open Model Development for Digital and Physical AI

Nvidia

Researchers worldwide rely on open-source technologies as the foundation of their work. To equip the community with the latest advancements in digital and physical AI, NVIDIA is further expanding its collection of open AI models, datasets and tools — with potential applications in virtually every research field. At NeurIPS, one of the world’s top AI […]

ai corporate driving research robotics

20 Nov 2025

Dion Harris 20 Nov 2025

Gordon Bell Prize Winners Push Open Science Boundaries With NVIDIA-Powered Supercomputers

Nvidia

Five finalists for the esteemed high-performance computing award have achieved breakthroughs in climate modeling, fluid simulation and more with the Alps, JUPITER and Perlmutter supercomputers — with two winners taking home the prize.

research supercomputing aerospace ai for good artificial intelligence

Zoe Kessler 20 Nov 2025 4 min read

The Largest Digital Zoo: Biology Model Trained on NVIDIA GPUs Identifies Over a Million Species

Nvidia

Tanya Berger-Wolf’s first computational biology project started as a bet with a colleague: that she could build an AI model capable of identifying individual zebras faster than a zoologist. She won. Now, the director of the Translational Data Analytics Institute and a professor at The Ohio State University, Berger-Wolf is taking on the whole animal […]

ai research artificial intelligence educationscience

20 Nov 2025 1 min read

Early experiments in accelerating science with GPT-5

OpenAI Engineering

OpenAI introduces the first research cases showing how GPT-5 accelerates scientific progress across math, physics, biology, and computer science. Explore how AI and researchers collaborate to generate proofs, uncover new insights, and reshape the pace of discovery.

research

19 Nov 2025

19 Nov 2025 1 min read

How evals drive the next chapter in AI for businesses

OpenAI Engineering

Learn how evals help businesses define, measure, and improve AI performance—reducing risk, boosting productivity, and driving strategic advantage.

research

18 Nov 2025

Dion Harris 18 Nov 2025 3 min read

The Great Flip: How Accelerated Computing Redefined Scientific Systems — and What Comes Next

Nvidia

Where CPUs once ruled, power efficiency — and then AI — flipped the balance. Extreme co-design across GPUs, networking and software now drives the frontier of science.

ai infrastructure networking research supercomputing cuda-x

17 Nov 2025

Kibibi Moseley 17 Nov 2025 5 min read

NVIDIA Accelerated Computing Enables Scientific Breakthroughs for Materials Discovery

Nvidia

To power future technologies including liquid-cooled data centers, high-resolution digital displays and long-lasting batteries, scientists are searching for novel chemicals and materials optimized for factors like energy use, durability and efficacy. New NVIDIA-accelerated data processing pipelines and AI microservices unveiled at the SC25 conference in St. Louis are advancing chemistry and material science to support […]

ai infrastructure research supercomputing artificial intelligence cuda-x

13 Nov 2025

13 Nov 2025 1 min read

Understanding neural networks through sparse circuits

OpenAI Engineering

OpenAI is exploring mechanistic interpretability to understand how neural networks reason. Our new sparse model approach could make AI systems more transparent and support safer, more reliable behavior.

research

3 Nov 2025

3 Nov 2025 1 min read

Introducing IndQA

OpenAI Engineering

OpenAI introduces IndQA, a new benchmark for evaluating AI systems in Indian languages. Built with domain experts, IndQA tests cultural understanding and reasoning across 12 languages and 10 knowledge areas.

research

9 Oct 2025

9 Oct 2025 1 min read

Defining and evaluating political bias in LLMs

OpenAI Engineering

Learn how OpenAI evaluates political bias in ChatGPT through new real-world testing methods that improve objectivity and reduce bias.

research

3 Oct 2025

Jacopo Bono 3 Oct 2025 12 min read

CAUSAL CONCEPT-BASED EXPLANATIONS

Feedzai

Introduction Over the years, we have evolved from using simple, often rule-based algorithms to sophisticated machine learning models. These models are incredibly good at finding patterns in large datasets, but due to their complexity it is frequently challenging for a human to understand why a certain input leads to its respective output. This is especially problematic in areas where high-stakes…

researchconcept-learningcausalitydeep-learningexplainability

30 Sept 2025

30 Sept 2025 1 min read

Sora 2 is here

OpenAI Engineering

Our latest video generation model is more physically accurate, realistic, and controllable than prior systems. It also features synchronized dialogue and sound effects. Create with it in the new Sora app.

research

15 Sept 2025

15 Sept 2025 1 min read

How people are using ChatGPT

OpenAI Engineering

New research from the largest study of ChatGPT use shows how the tool creates economic value through both personal and professional use. Adoption is broadening beyond early users, closing gaps and making AI a part of everyday life.

research

5 Sept 2025

5 Sept 2025 1 min read

Why language models hallucinate

OpenAI Engineering

OpenAI’s new research explains why language models hallucinate. The findings show how improved evaluations can enhance AI reliability, honesty, and safety.

research

25 Jul 2025

Sofia Guerreiro 25 Jul 2025 8 min read

Feedzai TrustScore: Enabling Network Intelligence to Fight Financial Crime

Feedzai

By Sofia Guerreiro, Ricardo Ribeiro Pereira, Iker Perez, Jacopo Bono Detecting financial fraud is like finding a moving needle in a shifting haystack . Fraud accounts for a tiny fraction of financial transactions, often less than 0.1%. At the same time, fraudsters are constantly adapting their tactics to evade detection. And this happens within a live and dynamic environment, where…

machine-learningfraud-detectionresearchnetwork-intelligencefeedzai

21 Mar 2025

21 Mar 2025 1 min read

Early methods for studying affective use and emotional well-being on ChatGPT

OpenAI Engineering

An OpenAI and MIT Media Lab Research collaboration.

research

2 Feb 2025

2 Feb 2025 1 min read

Introducing deep research

OpenAI Engineering

An agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks for you. Available to Pro users today, Plus and Team next.

research

31 Jan 2025

31 Jan 2025 1 min read

OpenAI o3-mini System Card

OpenAI Engineering

This report outlines the safety work carried out for the OpenAI o3-mini model, including safety evaluations, external red teaming, and Preparedness Framework evaluations.

research

31 Jan 2025

OpenAI o3-mini

OpenAI Engineering

research

23 Jan 2025

Computer-Using Agent

OpenAI Engineering

research

22 Jan 2025

22 Jan 2025 1 min read

Trading inference-time compute for adversarial robustness

OpenAI Engineering

Trading Inference-Time Compute for Adversarial Robustness

research

5 Dec 2024

5 Dec 2024 1 min read

OpenAI o1 System Card

OpenAI Engineering

This report outlines the safety work carried out prior to releasing OpenAI o1 and o1-mini, including external red teaming and frontier risk evaluations according to our Preparedness Framework.

research

22 Nov 2024

Beatriz Feliciano 22 Nov 2024 4 min read

“Show Me What’s Wrong!”: Enhancing Fraud Detection Analysis by Combining Charts and Text

Feedzai

Every year, millions of people fall victim to financial fraud. In 2023, the losses tied to this type of crime were estimated at US$159 billion just in the US , with some people losing all of their retirement savings to scammers . However, the impacts of this issue stretch beyond someone’s finances. It can also impact a victim’s life in…

fraud-investigationresearchfinancial-frauddata-visualization data-analysis

21 Nov 2024

21 Nov 2024 1 min read

Advancing red teaming with people and AI

OpenAI Engineering

Advancing red teaming with people and AI

research

30 Oct 2024

30 Oct 2024 1 min read

Introducing SimpleQA

OpenAI Engineering

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.

research

23 Oct 2024

23 Oct 2024 1 min read

Simplifying, stabilizing, and scaling continuous-time consistency models

OpenAI Engineering

We’ve simplified, stabilized, and scaled continuous-time consistency models, achieving comparable sample quality to leading diffusion models, while using only two sampling steps.

research

15 Oct 2024

15 Oct 2024 1 min read

Evaluating fairness in ChatGPT

OpenAI Engineering

We've analyzed how ChatGPT responds to users based on their name, using AI research assistants to protect privacy.

research

10 Oct 2024

10 Oct 2024 1 min read

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

OpenAI Engineering

We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.

research

4 Oct 2024

Ricardo Ribeiro Pereira 4 Oct 2024 9 min read

The GANfather: Using Malicious GenAI Agents to Combat Money Laundering

Feedzai

Digital systems have become deeply integrated into many aspects of modern life, particularly within the financial sector. While digital banking simplifies day-to-day operations for clients, it also creates new opportunities for malicious actors to exploit these systems. As a result, money laundering has grown particularly prevalent due to this digital expansion. Banks are required to monitor for money laundering activities…

gansmoney-launderingfeedzai genai research

12 Sept 2024

Learning to reason with LLMs

OpenAI Engineering

research

12 Sept 2024 1 min read

OpenAI o1-mini

OpenAI Engineering

Advancing cost-efficient reasoning

research

13 Aug 2024

13 Aug 2024 1 min read

Introducing SWE-bench Verified

OpenAI Engineering

We’re releasing a human-validated subset of SWE-bench that more reliably evaluates AI models’ ability to solve real-world software issues.

research

12 Aug 2024

Sérgio Jesus 12 Aug 2024 9 min read

Aequitas Flow step-by-step: a Fair ML optimization framework

Feedzai

By Sérgio Jesus, Inês Silva, Pedro Saleiro, Hugo Ferreira, Pedro Bizarro In this blog post we will visit Aequitas Flow , an Open-Source framework designed to run complete and standardized experiments of Fair ML algorithms. We encourage you to try Aequitas Flow with the Google Colab Notebooks, which are available in the project’s GitHub repository . This blog post is…

responsible-aifairnessopen-source research machine-learning

24 Jul 2024

24 Jul 2024 1 min read

Improving Model Safety Behavior with Rule-Based Rewards

OpenAI Engineering

We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.

research

18 Jul 2024

GPT-4o mini: advancing cost-efficient intelligence

OpenAI Engineering

research

17 Jul 2024

17 Jul 2024 1 min read

Prover-Verifier Games improve legibility of language model outputs

OpenAI Engineering

Discover how prover-verifier games improve the legibility of language model outputs, making AI solutions clearer, easier to verify, and more trustworthy for both humans and machines.

research

10 Jul 2024

10 Jul 2024 1 min read

OpenAI and Los Alamos National Laboratory announce research partnership

OpenAI Engineering

OpenAI and Los Alamos National Laboratory are working to develop safety evaluations to assess and measure biological capabilities and risks associated with frontier models.

research

21 Jun 2024

Javier Liébana 21 Jun 2024 13 min read

Building Trust in a Digital World: The Role of Machine Learning in Behavioral Biometrics

Feedzai

In the world of financial services, the bank or financial institution’s relationship with the customer relies on digital trust , which is anchored in two fundamental principles. First, it must ensure the person engaging through digital banking channels is genuinely the individual they claim to be. Second, it must confirm that this person is authorized to complete the intended financial…

feedzaidigital-trustonline-fraud-preventionmachine-learning research

20 Jun 2024

20 Jun 2024 1 min read

A Holistic Approach to Undesired Content Detection in the Real World

OpenAI Engineering

We present a holistic approach to building a robust and useful natural language classification system for real-world content moderation.

research

20 Jun 2024 1 min read

Consistency Models

OpenAI Engineering

Diffusion models have significantly advanced the fields of image, audio, and video generation, but they depend on an iterative sampling process that causes slow generation.

research

20 Jun 2024 1 min read

Improved Techniques for Training Consistency Models

OpenAI Engineering

Consistency models are a nascent family of generative models that can sample high quality data in one step without the need for adversarial training.

research

6 Jun 2024

6 Jun 2024 1 min read

Extracting Concepts from GPT-4

OpenAI Engineering

Using new techniques for scaling sparse autoencoders, we automatically identified 16 million patterns in GPT-4's computations.

research

13 May 2024

13 May 2024 1 min read

Hello GPT-4o

OpenAI Engineering

We’re announcing GPT-4 Omni, our new flagship model which can reason across audio, vision, and text in real time.

research

7 May 2024

7 May 2024 1 min read

Understanding the source of what we see and hear online

OpenAI Engineering

Today we’re introducing new technology to help researchers identify content created by our tools and joining the Coalition for Content Provenance and Authenticity Steering Committee to promote industry standards.

research

19 Apr 2024

19 Apr 2024 1 min read

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

OpenAI Engineering

Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.

research

15 Feb 2024

15 Feb 2024 1 min read

Video generation models as world simulators

OpenAI Engineering

We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of high fidelity video. Our results…

research

31 Jan 2024

31 Jan 2024 1 min read

Building an early warning system for LLM-aided biological threat creation

OpenAI Engineering

We’re developing a blueprint for evaluating the risk that a large language model (LLM) could aid someone in creating a biological threat. In an evaluation involving both biology experts and students, we found that GPT-4 provides at most a mild uplift in biological threat creation accuracy. While this uplift is not large enough to be conclusive, our finding is a…

research

31 May 2023

31 May 2023 1 min read

Improving mathematical reasoning with process supervision

OpenAI Engineering

We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final answer (“outcome supervision”). In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to produce a chain-of-thought that is…

research

25 May 2023

25 May 2023 1 min read

Democratic inputs to AI

OpenAI Engineering

Our nonprofit organization, OpenAI, Inc., is launching a program to award ten $100,000 grants to fund experiments in setting up a democratic process for deciding what rules AI systems should follow, within the bounds defined by the law.

research

17 Mar 2023

GPTs are GPTs: An early look at the labor market impact potential of large language models

OpenAI Engineering

research

14 Mar 2023

14 Mar 2023 1 min read

GPT-4

OpenAI Engineering

We’ve created GPT-4, the latest milestone in OpenAI’s effort in scaling up deep learning. GPT-4 is a large multimodal model (accepting image and text inputs, emitting text outputs) that, while less capable than humans in many real-world scenarios, exhibits human-level performance on various professional and academic benchmarks.

research

16 Dec 2022

Point-E: A system for generating 3D point clouds from complex prompts

OpenAI Engineering

research

19 Oct 2022

Scaling laws for reward model overoptimization

OpenAI Engineering

research

21 Sept 2022

Introducing Whisper

OpenAI Engineering

research

28 Jul 2022

Efficient training of language models to fill in the middle

OpenAI Engineering

research

28 Jun 2022

28 Jun 2022 1 min read

DALL·E 2 pre-training mitigations

OpenAI Engineering

In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this end, we put various guardrails in place to prevent generated images from violating our content policy.

research

23 Jun 2022

23 Jun 2022 1 min read

Learning to play Minecraft with Video PreTraining

OpenAI Engineering

We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over 20 minutes (24,000 actions). Our model uses the native human…

research

17 Jun 2022

Evolution through large models

OpenAI Engineering

research

9 Jun 2022

9 Jun 2022 1 min read

Techniques for training large neural networks

OpenAI Engineering

Large neural networks are at the core of many recent advances in AI, but training them is a difficult engineering and research challenge which requires orchestrating a cluster of GPUs to perform a single synchronized calculation.

research

28 May 2022

Teaching models to express their uncertainty in words

OpenAI Engineering

research

13 Apr 2022

Hierarchical text-conditional image generation with CLIP latents

OpenAI Engineering

research

3 Mar 2022

A research agenda for assessing the economic impacts of code generation models

OpenAI Engineering

research

2 Feb 2022

2 Feb 2022 1 min read

Solving (some) formal math olympiad problems

OpenAI Engineering

We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AMC12 and AIME competitions, as well as two problems adapted from the IMO.

research

24 Jan 2022

Text and code embeddings by contrastive pre-training

OpenAI Engineering

research

16 Dec 2021

16 Dec 2021 1 min read

WebGPT: Improving the factual accuracy of language models through web browsing

OpenAI Engineering

We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser.

research

29 Oct 2021

29 Oct 2021 1 min read

Solving math word problems

OpenAI Engineering

We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems as real kids: a small sample of 9-12 year olds scored 60% on a test from our dataset, while our system scored 55% on those same problems.

research

8 Sept 2021

TruthfulQA: Measuring how models mimic human falsehoods

OpenAI Engineering

research

28 Jul 2021

28 Jul 2021 1 min read

Introducing Triton: Open-source GPU programming for neural networks

OpenAI Engineering

We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code—most of the time on par with what an expert would be able to produce.

research

7 Jul 2021

Evaluating large language models trained on code

OpenAI Engineering

research

1 Jul 2021

Ken Howard 1 Jul 2021 1 min read

Cloud Application Security – Risks, Questions, Insights, and Solutions

OpenDNS

Cloud-based applications have helped make a new world of work possible. But they have also opened up the doors to new risks and threats, such as ransomware by remote desktop takeover and data loss through unprotected cloud storage use. The post Cloud Application Security – Risks, Questions, Insights, and Solutions appeared first on Cisco Umbrella.

research

4 Mar 2021

4 Mar 2021 1 min read

Multimodal neurons in artificial neural networks

OpenAI Engineering

We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP’s accuracy in classifying surprising visual renditions of concepts, and is also an important step toward understanding the associations and biases that CLIP and similar models learn.

research

4 Feb 2021

Understanding the capabilities, limitations, and societal impact of large language models

OpenAI Engineering

research

25 Jan 2021

25 Jan 2021 1 min read

Scaling Kubernetes to 7,500 nodes

OpenAI Engineering

We’ve scaled Kubernetes clusters to 7,500 nodes, producing a scalable infrastructure for large models like GPT-3, CLIP, and DALL·E, but also for rapid small-scale iterative research such as Scaling Laws for Neural Language Models.

research

5 Jan 2021

5 Jan 2021 1 min read

DALL·E: Creating images from text

OpenAI Engineering

We’ve trained a neural network called DALL·E that creates images from text captions for a wide range of concepts expressible in natural language.

research

5 Jan 2021 1 min read

CLIP: Connecting text and images

OpenAI Engineering

We’re introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision. CLIP can be applied to any visual classification benchmark by simply providing the names of the visual categories to be recognized, similar to the “zero-shot” capabilities of GPT-2 and GPT-3.

research

7 Sept 2020

Generative language modeling for automated theorem proving

OpenAI Engineering

research

17 Jun 2020

17 Jun 2020 1 min read

Image GPT

OpenAI Engineering

We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can generate coherent image completions and samples. By establishing a correlation between sample quality and image classification accuracy, we show that our best generative model also contains features competitive with top convolutional nets in the unsupervised…

research

28 May 2020

Language models are few-shot learners

OpenAI Engineering

research

5 May 2020

5 May 2020 1 min read

AI and efficiency

OpenAI Engineering

We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has been decreasing by a factor of 2 every 16 months. Compared to 2012, it now takes 44 times less compute to train a neural network to the level of AlexNet (by contrast, Moore’s Law…

research

30 Apr 2020

30 Apr 2020 1 min read

Jukebox

OpenAI Engineering

We’re introducing Jukebox, a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles. We’re releasing the model weights and code, along with a tool to explore the generated samples.

research

16 Apr 2020

16 Apr 2020 1 min read

Improving verifiability in AI development

OpenAI Engineering

We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwartz Reisman Institute for Technology and Society, Center for Advanced Study in the Behavioral Sciences, and Center for Security and Emerging Technologies. This report describes 10 mechanisms to improve the verifiability of claims made about AI systems. Developers can…

research

14 Apr 2020

14 Apr 2020 1 min read

OpenAI Microscope

OpenAI Engineering

We’re introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision “model organisms” which are often studied in interpretability. Microscope makes it easier to analyze the features that form inside these neural networks, and we hope it will help the research community as we move towards understanding these complicated systems.

research

23 Jan 2020

Scaling laws for neural language models

OpenAI Engineering

research

19 Dec 2019

Kadir Topal 19 Dec 2019 2 min read

Presenting the MDN Web Developer Needs Assessment (Web DNA) Report

Mozilla Hacks

The first annual MDN Developer Needs Assessment aims to represent the voices of developers and designers working on the web. We've analyzed the data provided by more than 28,000 completed surveys, and we've identified 28 discrete needs, sorted into 14 different themes. Four of the top ten needs relate to browser compatibility, our #1 theme. Documentation, Testing, Debugging, and Frameworks…

featured article mdn news research survey

13 Dec 2019

Dota 2 with large scale deep reinforcement learning

OpenAI Engineering

research

5 Dec 2019

5 Dec 2019 1 min read

Deep double descent

OpenAI Engineering

We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then improves again with increasing model size, data size, or training time. This effect is often avoided through careful regularization. While this behavior appears to be fairly universal, we don’t yet fully understand why it happens, and view further study…

research

3 Dec 2019

3 Dec 2019 1 min read

Procgen Benchmark

OpenAI Engineering

We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills.

research

5 Nov 2019

5 Nov 2019 1 min read

GPT-2: 1.5B release

OpenAI Engineering

As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection of outputs of GPT-2 models. While there have been larger language models released since August, we’ve continued with our original staged release plan in order to provide the community with a test case…

research

15 Oct 2019

15 Oct 2019 1 min read

Solving Rubik’s Cube with a robot hand

OpenAI Engineering

We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using the same reinforcement learning code as OpenAI Five paired with a new technique called Automatic Domain Randomization (ADR). The system can handle situations it never saw during training, such as being prodded by a…

research

17 Sept 2019

17 Sept 2019 1 min read

Emergent tool use from multi-agent interaction

OpenAI Engineering

We’ve observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simulated hide-and-seek environment, agents build a series of six distinct strategies and counterstrategies, some of which we did not know our environment supported. The self-supervised emergent complexity in this simple environment further suggests that multi-agent co-adaptation may one day…

research

20 Aug 2019

20 Aug 2019 1 min read

GPT-2: 6-month follow-up

OpenAI Engineering

We’re releasing the 774 million parameter GPT-2 language model after the release of our small 124M model in February, staged release of our medium 355M model in May, and subsequent research with partners and the AI community into the model’s potential for misuse and societal benefit. We’re also releasing an open-source legal agreement to make it easier for organizations to…

research

23 May 2019

Nathan Egge 23 May 2019 3 min read

Firefox brings you smooth video playback with the world’s fastest AV1 decoder

Mozilla Hacks

With this week's release of Firefox 67, the new high performance royalty-free AV1 video decoder dav1d is now enabled by default on all desktop platforms (Windows, OSX and Linux) for both 32-bit and 64-bit systems. And work is in progress on rav1e, the Rust AV1 encoder. The post Firefox brings you smooth video playback with the world’s fastest AV1 decoder…

av1featured article firefox performance research

25 Apr 2019

25 Apr 2019 1 min read

MuseNet

OpenAI Engineering

We’ve created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles. MuseNet was not explicitly programmed with our understanding of music, but instead discovered patterns of harmony, rhythm, and style by learning to predict the next token in hundreds of thousands of MIDI…

research

23 Apr 2019

23 Apr 2019 1 min read

Generative modeling with sparse transformers

OpenAI Engineering

We’ve developed the Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images, or sound. It uses an algorithmic improvement of the attention mechanism to extract patterns from sequences 30x longer than possible previously.

research

15 Apr 2019

15 Apr 2019 1 min read

OpenAI Five defeats Dota 2 world champions

OpenAI Engineering

OpenAI Five is the first AI to beat the world champions in an esports game, having won two back-to-back games versus the world champion Dota 2 team, OG, at Finals this weekend. Both OpenAI Five and DeepMind’s AlphaStar had previously beaten good pros privately but lost their live pro matches, making this also the first time an AI has beaten…

research

21 Mar 2019

21 Mar 2019 1 min read

Implicit generation and generalization methods for energy-based models

OpenAI Engineering

We’ve made progress towards stable and scalable training of energy-based models (EBMs) resulting in better sample quality and generalization ability than existing models. Generation in EBMs spends more compute to continually refine its answers and doing so can generate samples competitive with GANs at low temperatures, while also having mode coverage guarantees of likelihood-based models. We hope these findings stimulate…

research

4 Mar 2019

4 Mar 2019 1 min read

Neural MMO: A massively multiagent game environment

OpenAI Engineering

We’re releasing a Neural MMO, a massively multiagent game environment for reinforcement learning agents. Our platform supports a large, variable number of agents within a persistent and open-ended task. The inclusion of many agents and species leads to better exploration, divergent niche formation, and greater overall competence.

research

14 Feb 2019

14 Feb 2019 1 min read

Better language models and their implications

OpenAI Engineering

We’ve trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many language modeling benchmarks, and performs rudimentary reading comprehension, machine translation, question answering, and summarization—all without task-specific training.

research

4 Feb 2019

Computational limitations in robust classification and win-win results

OpenAI Engineering

research

14 Dec 2018

14 Dec 2018 1 min read

How AI training scales

OpenAI Engineering

We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks. Since complex tasks tend to have noisier gradients, increasingly large batch sizes are likely to become useful in the future, removing one potential limit to further growth of AI systems. More broadly, these results show that…

research

6 Dec 2018

6 Dec 2018 1 min read

Quantifying generalization in reinforcement learning

OpenAI Engineering

We’re releasing CoinRun, a training environment which provides a metric for an agent’s ability to transfer its experience to novel situations and has already helped clarify a longstanding puzzle in reinforcement learning. CoinRun strikes a desirable balance in complexity: the environment is simpler than traditional platformer games like Sonic the Hedgehog but still poses a worthy generalization challenge for state…

research

8 Nov 2018

8 Nov 2018 1 min read

Spinning Up in Deep RL

OpenAI Engineering

We’re releasing Spinning Up in Deep RL, an educational resource designed to let anyone learn to become a skilled practitioner in deep reinforcement learning. Spinning Up consists of crystal-clear examples of RL code, educational exercises, documentation, and tutorials.

research

7 Nov 2018

7 Nov 2018 1 min read

Learning concepts with energy functions

OpenAI Engineering

We’ve developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, and furthest, expressed as sets of 2d points. Our model learns these concepts after only five demonstrations. We also show cross-domain transfer: we use concepts learned in a 2d particle environment to solve tasks on a 3-dimensional physics-based…

research

5 Nov 2018

Plan online, learn offline: Efficient learning and exploration via model-based control

OpenAI Engineering

research

31 Oct 2018

31 Oct 2018 1 min read

Reinforcement learning with prediction-based rewards

OpenAI Engineering

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Revenge.

research

2 Oct 2018

FFJORD: Free-form continuous dynamics for scalable reversible generative models

OpenAI Engineering

research

23 Aug 2018

23 Aug 2018 1 min read

The International 2018: Results

OpenAI Engineering

OpenAI Five lost two games against top Dota 2 players at The International in Vancouver this week, maintaining a good chance of winning for the first 20–35 minutes of both games.

research

13 Aug 2018

Large-scale study of curiosity-driven learning

OpenAI Engineering

research

6 Aug 2018

6 Aug 2018 1 min read

OpenAI Five Benchmark: Results

OpenAI Engineering

Yesterday, OpenAI Five won a best-of-three against a team of 99.95th percentile Dota players: Blitz, Cap, Fogged, Merlini, and MoonMeander—four of whom have played Dota professionally—in front of a live audience and 100,000 concurrent livestream viewers.

research

30 Jul 2018

30 Jul 2018 1 min read

Learning dexterity

OpenAI Engineering

We’ve trained a human-like robot hand to manipulate physical objects with unprecedented dexterity.

research

26 Jul 2018

Variational option discovery algorithms

OpenAI Engineering

research

9 Jul 2018

9 Jul 2018 1 min read

Glow: Better reversible generative models

OpenAI Engineering

We introduce Glow, a reversible generative model which uses invertible 1x1 convolutions. It extends previous work on reversible generative models and simplifies the architecture. Our model can generate realistic high resolution images, supports efficient sampling, and discovers features that can be used to manipulate attributes of data. We’re releasing code for the model and an online visualization tool so people…

research

4 Jul 2018

4 Jul 2018 1 min read

Learning Montezuma’s Revenge from a single demonstration

OpenAI Engineering

We’ve trained an agent to achieve a high score of 74,500 on Montezuma’s Revenge from a single human demonstration, better than any previously published result. Our algorithm is simple: the agent plays a sequence of games starting from carefully chosen states from the demonstration, and learns from them by optimizing the game score using PPO, the same reinforcement learning algorithm…

research

25 Jun 2018

25 Jun 2018 1 min read

OpenAI Five

OpenAI Engineering

Our team of five neural networks, OpenAI Five, has started to defeat amateur human teams at Dota 2.

research

22 Jun 2018

22 Jun 2018 1 min read

Retro Contest: Results

OpenAI Engineering

The first run of our Retro Contest—exploring the development of algorithms that can generalize from previous experience—is now complete.

research

17 Jun 2018

Learning policy representations in multiagent systems

OpenAI Engineering

research

2 Jun 2018

GamePad: A learning environment for theorem proving

OpenAI Engineering

research

25 May 2018

25 May 2018 1 min read

Gym Retro

OpenAI Engineering

We’re releasing the full version of Gym Retro, a platform for reinforcement learning research on games. This brings our publicly-released game count from around 70 Atari games and 30 Sega games to over 1,000 games across a variety of backing emulators. We’re also releasing the tool we use to add new games to the platform.

research

16 May 2018

16 May 2018 1 min read

AI and compute

OpenAI Engineering

We’re releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with a 3.4-month doubling time (by comparison, Moore’s Law had a 2-year doubling period)[^footnote-correction]. Since 2012, this metric has grown by more than 300,000x (a 2-year doubling period would yield only a 7x increase). Improvements in compute…

research

18 Apr 2018

18 Apr 2018 1 min read

Evolved Policy Gradients

OpenAI Engineering

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a different side of the room…

research

10 Apr 2018

Gotta Learn Fast: A new benchmark for generalization in RL

OpenAI Engineering

research

5 Apr 2018

5 Apr 2018 1 min read

Retro Contest

OpenAI Engineering

We’re launching a transfer learning contest that measures a reinforcement learning algorithm’s ability to generalize from previous experience.

research

20 Mar 2018

Variance reduction for policy gradient with action-dependent factorized baselines

OpenAI Engineering

research

15 Mar 2018

Improving GANs using optimal transport

OpenAI Engineering

research

8 Mar 2018

On first-order meta-learning algorithms

OpenAI Engineering

research

7 Mar 2018

7 Mar 2018 1 min read

Reptile: A scalable meta-learning algorithm

OpenAI Engineering

We’ve developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on it, and updating the initial parameters towards the final parameters learned on that task. Reptile is the application of the Shortest Descent algorithm to the meta-learning setting, and is mathematically similar to first-order MAML (which is a version of the…

research

3 Mar 2018

Some considerations on learning to explore via meta-reinforcement learning

OpenAI Engineering

research

26 Feb 2018

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research

OpenAI Engineering

research

26 Feb 2018 1 min read

Ingredients for robotics research

OpenAI Engineering

We’re releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, all developed for our research over the past year. We’ve used these environments to train models which work on physical robots. We’re also releasing a set of requests for robotics research.

research

15 Feb 2018

15 Feb 2018 1 min read

Interpretable machine learning through teaching

OpenAI Engineering

We’ve designed a method that encourages AIs to teach each other with examples that also make sense to humans. Our approach automatically selects the most informative examples to teach a concept—for instance, the best images to describe the concept of dogs—and experimentally we found our approach to be effective at teaching both AIs

research

7 Feb 2018

7 Feb 2018 1 min read

Discovering types for entity disambiguation

OpenAI Engineering

We’ve built a system for automatically figuring out which object is meant by a word by having a neural network decide if the word belongs to each of about 100 automatically-discovered “types” (non-exclusive categories).

research

31 Jan 2018

31 Jan 2018 1 min read

Requests for Research 2.0

OpenAI Engineering

We’re releasing a new batch of seven unsolved problems which have come up in the course of our research at OpenAI.

research

18 Jan 2018

Scaling Kubernetes to 2,500 nodes

OpenAI Engineering

research

6 Dec 2017

6 Dec 2017 1 min read

Block-sparse GPU kernels

OpenAI Engineering

We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE. We’ve used them to attain state-of-the-art results in text sentiment analysis and generative modeling of text and images.

research

4 Dec 2017

Learning sparse neural networks through L₀ regularization

OpenAI Engineering

research

2 Nov 2017

Interpretable and pedagogical examples

OpenAI Engineering

research

26 Oct 2017

26 Oct 2017 1 min read

Learning a hierarchy

OpenAI Engineering

We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. Our algorithm, when applied to a set of navigation problems, discovers a set of high-level actions for walking and crawling in different directions, which enables the agent to master new navigation tasks quickly.

research

19 Oct 2017

19 Oct 2017 1 min read

Generalizing from simulation

OpenAI Engineering

Our latest robotics techniques allow robot controllers, trained entirely in simulation and deployed on physical robots, to react to unplanned changes in the environment as they solve simple tasks. That is, we’ve used these techniques to build closed-loop systems rather than open-loop ones as before.

research

18 Oct 2017

Asymmetric actor critic for image-based robot learning

OpenAI Engineering

research

18 Oct 2017

Sim-to-real transfer of robotic control with dynamics randomization

OpenAI Engineering

research

17 Oct 2017

Domain randomization and generative models for robotic grasping

OpenAI Engineering

research

11 Oct 2017

11 Oct 2017 1 min read

Meta-learning for wrestling

OpenAI Engineering

We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent, and also show that the meta-learning agent can adapt to physical malfunction.

research

11 Oct 2017 1 min read

Competitive self-play

OpenAI Engineering

We’ve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without explicitly designing an environment with these skills in mind. Self-play ensures that the environment is always the right difficulty for an AI to improve. Taken alongside our Dota 2 self-play results, we have increasing confidence that self-play…

research

29 Sept 2017

Nonlinear computation in deep linear networks

OpenAI Engineering

research

14 Sept 2017

14 Sept 2017 1 min read

Learning to model other minds

OpenAI Engineering

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner’s dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agents that model other minds.

research

13 Sept 2017

Learning with opponent-learning awareness

OpenAI Engineering

research

18 Aug 2017

18 Aug 2017 1 min read

OpenAI Baselines: ACKTR & A2C

OpenAI Engineering

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement learning algorithm than TRPO and A2C, and requires only slightly more computation than A2C per update.

research

16 Aug 2017

16 Aug 2017 1 min read

More on Dota 2

OpenAI Engineering

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to beating the top pros and has continued to improve since then. Supervised deep learning systems can only be as…

research

11 Aug 2017

11 Aug 2017 1 min read

Dota 2

OpenAI Engineering

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. This is a step towards building AI systems which accomplish well-defined goals in messy, complicated situations involving real humans.

research

3 Aug 2017

3 Aug 2017 1 min read

Gathering human feedback

OpenAI Engineering

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

research

27 Jul 2017

27 Jul 2017 1 min read

Better exploration with parameter noise

OpenAI Engineering

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

research

20 Jul 2017

20 Jul 2017 1 min read

Proximal Policy Optimization

OpenAI Engineering

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

research

17 Jul 2017

17 Jul 2017 1 min read

Robust adversarial inputs

OpenAI Engineering

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like.

research

5 Jul 2017

Hindsight Experience Replay

OpenAI Engineering

research

1 Jul 2017

Teacher–student curriculum learning

OpenAI Engineering

research

28 Jun 2017

28 Jun 2017 1 min read

Faster physics in Python

OpenAI Engineering

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

research

8 Jun 2017

8 Jun 2017 1 min read

Learning to cooperate, compete, and communicate

OpenAI Engineering

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum—the difficulty of the environment is determined by the skill of your competitors (and if you’re competing against clones of yourself, the environment exactly matches your skill level). Second, a multiagent environment has no…

research

5 Jun 2017

UCB exploration via Q-ensembles

OpenAI Engineering

research

24 May 2017

24 May 2017 1 min read

OpenAI Baselines: DQN

OpenAI Engineering

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants.

research

16 May 2017

16 May 2017 1 min read

Robots that learn

OpenAI Engineering

We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.

research

15 May 2017

15 May 2017 1 min read

Roboschool

OpenAI Engineering

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

research

21 Apr 2017

Equivalence between policy gradients and soft Q-learning

OpenAI Engineering

research

10 Apr 2017

Stochastic Neural Networks for hierarchical reinforcement learning

OpenAI Engineering

research

6 Apr 2017

6 Apr 2017 1 min read

Unsupervised sentiment neuron

OpenAI Engineering

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

research

1 Apr 2017

1 Apr 2017 1 min read

Spam detection in the physical world

OpenAI Engineering

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

research

24 Mar 2017

24 Mar 2017 1 min read

Evolution strategies as a scalable alternative to reinforcement learning

OpenAI Engineering

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.

research

21 Mar 2017

One-shot imitation learning

OpenAI Engineering

research

16 Mar 2017

16 Mar 2017 1 min read

Learning to communicate

OpenAI Engineering

In this post we’ll outline new OpenAI research in which agents develop their own language.

research

15 Mar 2017

Emergence of grounded compositional language in multi-agent populations

OpenAI Engineering

research

12 Mar 2017

Prediction and control with temporal segment models

OpenAI Engineering

research

6 Mar 2017

Third-person imitation learning

OpenAI Engineering

research

19 Jan 2017

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

OpenAI Engineering

research

5 Dec 2016

5 Dec 2016 1 min read

Universe

OpenAI Engineering

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.

research

15 Nov 2016

#Exploration: A study of count-based exploration for deep reinforcement learning

OpenAI Engineering

research

14 Nov 2016

On the quantitative analysis of decoder-based generative models

OpenAI Engineering

research

11 Nov 2016

A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models

OpenAI Engineering

research

9 Nov 2016

RL²: Fast reinforcement learning via slow reinforcement learning

OpenAI Engineering

research

8 Nov 2016

Variational lossy autoencoder

OpenAI Engineering

research

2 Nov 2016

Extensions and limitations of the neural GPU

OpenAI Engineering

research

11 Oct 2016

Transfer from simulation to real world through learning deep inverse dynamics model

OpenAI Engineering

research

29 Aug 2016

29 Aug 2016 1 min read

Infrastructure for deep learning

OpenAI Engineering

Deep learning is an empirical science, and the quality of a group’s infrastructure is a multiplier on progress. Fortunately, today’s open-source ecosystem makes it possible for anyone to build great deep learning infrastructure.

research

16 Jun 2016

16 Jun 2016 1 min read

Generative models

OpenAI Engineering

This post describes four projects that share a common theme of enhancing or using generative models, a branch of unsupervised learning techniques in machine learning. In addition to describing our work, this post will tell you a bit more about generative models: what they are, why they are important, and where they might be going.

research

27 Apr 2016

27 Apr 2016 1 min read

OpenAI Gym Beta

OpenAI Engineering

We’re releasing the public beta of OpenAI Gym, a toolkit for developing and comparing reinforcement learning (RL) algorithms. It consists of a growing suite of environments (from simulated robots to Atari games), and a site for comparing and reproducing results.

research

25 Feb 2016

Weight normalization: A simple reparameterization to accelerate training of deep neural networks

OpenAI Engineering

research