Search results
Results from the WOW.Com Content Network
The first GPT model was known as "GPT-1," and it was followed by "GPT-2" in February 2019. Created as a direct scale-up of its predecessor, GPT-2 had both its parameter count and dataset size increased by a factor of 10. It had 1.5 billion parameters, and was trained on a dataset of 8 million web pages. [9]
OpenAI has released significant GPT foundation models that have been sequentially numbered, to comprise its "GPT-n" series. [10] Each of these was significantly more capable than the previous, due to increased size (number of trainable parameters) and training. The most recent of these, GPT-4o, was released in May 2024. [11]
A fine-tuned variant of GPT-3, termed GPT-3.5, was made available to the public through a web interface called ChatGPT in 2022. [165] GPT-Neo: March 2021: EleutherAI: 2.7 [166] 825 GiB [167] MIT [168] The first of a series of free GPT-3 alternatives released by EleutherAI. GPT-Neo outperformed an equivalent-size GPT-3 model on some benchmarks ...
OpenAI's GPT-4 model was released on March 14, 2023. Observers saw it as an impressive improvement over GPT-3.5, with the caveat that GPT-4 retained many of the same problems. [96] Some of GPT-4's improvements were predicted by OpenAI before training it, while others remained hard to predict due to breaks [97] in downstream scaling laws.
On July 18, 2024, OpenAI released GPT-4o mini, a smaller version of GPT-4o replacing GPT-3.5 Turbo on the ChatGPT interface. Its API costs $0.15 per million input tokens and $0.60 per million output tokens, compared to $5 and $15 respectively for GPT-4o.
Generative AI systems trained on words or word tokens include GPT-3, GPT-4, GPT-4o, LaMDA, LLaMA, BLOOM, Gemini and others (see List of large language models). They are capable of natural language processing , machine translation , and natural language generation and can be used as foundation models for other tasks. [ 62 ]
OpenAI's chatbot, ChatGPT, was released in November 2022, is based on GPT-3.5 and GPT-4 large language models. Celeste Biever wrote in a Nature article that "ChatGPT broke the Turing test". [ 24 ]
A new version called GPT-4 was released on March 14, 2023, and was used in the Microsoft Bing search engine. [46] [47] Other language models have been released, such as PaLM and Gemini by Google and LLaMA by Meta Platforms. In January 2023, DeepL Write, an AI-based tool to improve monolingual texts, was released. [48]