ChatGPT is a chatbot and virtual assistant developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language.
...
Successive user prompts and replies are considered at each conversation stage as context.
Initial release November 30, 2022 (2 years ago) (2022-11-30) Type Chatbot, Large language model, Generative pre-trained transformer
Methods
We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with slight differences in the data collection setup. We trained an initial model using supervised fine-tuning: human AI trainers provided conversations in which they played both
...
sides—the user and an AI assistant. We gave the trainers access to model-written suggestions to help them compose their responses. We mixed this new dialogue dataset with the InstructGPT dataset, which we transformed into a dialogue format.
model
learning
feedback
data
collection
access
help
dialogue
Iterative deployment
Today’s research release of ChatGPT is the latest step in OpenAI’s iterative deployment of increasingly safe and useful AI systems. Many lessons from deployment of earlier models like GPT-3 and Codex have informed the safety mitigations in place for this release, including substantial reductions in harmful
...
and untruthful outputs achieved by the use of reinforcement learning from human feedback (RLHF).
research
safety
place
learning
feedback
User reviews
add review
Reviews represent subjective personal experiences. Individual experiences may vary.
Always take them with a grain of salt.