

OpenAI collects data from ChatGPT users to further train and fine-tune the service.

Following the success of ChatGPT, Microsoft dramatically upgraded the OpenAI infrastructure in 2023. ĬhatGPT initially used a Microsoft Azure supercomputing infrastructure, powered by Nvidia GPUs, that Microsoft built specifically for OpenAI and that reportedly cost "hundreds of millions of dollars". Proximal Policy Optimization algorithms are a cost-effective alternative to trust region policy optimization algorithms. These rankings were used to create "reward models" that the model was further fine-tuned on using several iterations of Proximal Policy Optimization (PPO). In the reinforcement learning step, human trainers first ranked responses that the model had created in a previous conversation. In the case of supervised learning, the model was provided with conversations in which the trainers played both sides: the user and the AI assistant. Both approaches use human trainers to improve the model's performance. The fine-tuning process leveraged both supervised learning as well as reinforcement learning in a process called reinforcement learning from human feedback (RLHF). It was fine-tuned (an approach to transfer learning ) over an improved version of OpenAI's GPT-3 known as " GPT 3.5". A version based on GPT-4, the newest OpenAI model, was released on March 14, 2023, and is available for paid subscribers on a limited basis.ĬhatGPT is a member of the generative pre-trained transformer (GPT) family of language models.

The original release of ChatGPT was based on GPT-3.5. Following the release of ChatGPT, OpenAI's valuation was estimated at US$29 billion in 2023. Its uneven factual accuracy, however, has been identified as a significant drawback. It garnered attention for its detailed responses and articulate answers across many domains of knowledge. It is built on top of OpenAI's GPT-3.5 and GPT-4 families of large language models (LLMs) and has been fine-tuned (an approach to transfer learning) using both supervised and reinforcement learning techniques.ĬhatGPT was launched as a prototype on November 30, 2022. ChatGPT is an artificial-intelligence (AI) chatbot developed by OpenAI and launched in November 2022.
