AI boom
The AI boom refers to a period of rapid and unprecedented development in the field of artificial intelligence, with the generative AI race being a key component of this boom, which began in earnest with the founding of OpenAI in 2016 or 2017.[1]
OpenAI's generative AI systems, such as GPT (2018) and DALL-E (2021), have played a significant role in driving this development.[2][3][4] In 2022, large language models were improved to where they could be used for chatbot applications; text-to-image-models were at a point where they were almost indiscernible from human-made imagery;[5] and speech synthesis software was able to replicate human speech efficiently.[6]
Language models
GPT-3 is a large language model that was released in 2020 by OpenAI and is capable of generating high-quality human-like text that can be hard to determine whether it was hard to be written by a human or not.[7] An upgraded version called GPT-3.5 was used in ChatGPT, which later garnered attention for its detailed responses and articulate answers across many domains of knowledge.[8] A new version called GPT-4 was released on March 2023 and was used in the Microsoft Bing search engine.[9][10] Other language models have been released such as LaMDA by Google and LLaMA by Meta Platforms.
In January 2023, DeepL Write, an AI-based tool to improve monolingual texts, was released.[11]
Text-to-image models
One of the first text-to-image models to capture widespread public attention was OpenAI's DALL-E, a transformer system announced in January 2021.[12] A successor capable of generating more complex and realistic images, DALL-E 2, was unveiled in April 2022,[13] followed by Stable Diffusion, an open source alternative, releasing in August 2022.[14]
Following other text-to-image models, language model-powered text-to-video platforms such as DAMO,[15] Make-A-Video,[16] Imagen Video[17] and Phenaki[18] can generate video from text and/or text/image prompts.[19]
Speech synthesis
15.ai was one of the first publicly available speech synthesis software that allowed people to generate natural emotive high-fidelity text-to-speech voices from an assortment of fictional characters from a variety of media sources. It was first released on March 2020.[20][21] ElevenLabs unveiled a website where users are able to upload voice samples to that allowed it to generate voices from them. The company was criticized after users were able to abuse its software to generate controversial statements in the vocal style of celebrities, public officials, and other famous individuals[22] and raised concerns that it could be used to generate deepfakes that were more convincing.[23]
References
- "Why am I not terrified of AI?". Shtetl-Optimized. 2023-03-06. Retrieved 2023-03-19.
- Newman, Daniel. "Exploring The Ins And Outs Of The Generative AI Boom". Forbes. Retrieved 2023-03-14.
- "The AI boom: lessons from history | The Economist". The Economist. 2023-03-13. Archived from the original on 2023-03-13. Retrieved 2023-03-15.
- Kafka, Peter (2023-02-01). "The AI boom is here, and so are the lawsuits". Vox. Retrieved 2023-03-15.
- Vincent, James (2022-05-24). "All these images were generated by Google's latest text-to-image AI". The Verge. Retrieved 2023-03-15.
- "AI-Generated Voice Firm Clamps Down After 4chan Makes Celebrity Voices for Abuse". www.vice.com. Retrieved 2023-03-15.
- Sagar, Ram (2020-06-03). "OpenAI Releases GPT-3, The Largest Model So Far". Analytics India Magazine. Retrieved 2023-03-15.
- Lock, Samantha (2022-12-05). "What is AI chatbot phenomenon ChatGPT and could it replace humans?". The Guardian. ISSN 0261-3077. Retrieved 2023-03-15.
- Lardinois, Frederic (2023-03-14). "Microsoft's new Bing was using GPT-4 all along". TechCrunch. Retrieved 2023-03-15.
- "OpenAI announces ChatGPT successor GPT-4". BBC News. 2023-03-14. Retrieved 2023-03-15.
- Ziegener, Daniel (17 January 2023). "DeepL Write: Brauchen wir jetzt noch eine menschliche Lektorin?". Golem.de.
- Coldewey, Devin (2021-01-05). "OpenAI's DALL-E creates plausible images of literally anything you ask it to". TechCrunch. Retrieved 2023-03-15.
- Coldewey, Devin (2022-04-06). "New OpenAI tool draws anything, bigger and better than ever". TechCrunch. Retrieved 2023-03-15.
- "Stable Diffusion Public Release". Stability AI. Retrieved 2023-03-15.
- "ModelScope 魔搭社区". modelscope.cn. Retrieved 2023-03-20.
- kumar, Ashish (2022-10-03). "Meta AI Introduces 'Make-A-Video': An Artificial Intelligence System That Generates Videos From Text". MarkTechPost. Retrieved 2023-03-15.
- Edwards, Benj (2022-10-05). "Google's newest AI generator creates HD video from text prompts". Ars Technica. Retrieved 2022-10-25.
- "Phenaki". phenaki.video. Retrieved 2022-10-03.
- Edwards, Benj (9 September 2022). "Runway teases AI-powered text-to-video editing using written prompts". Ars Technica. Retrieved 12 September 2022.
- Zwiezen, Zack (2021-01-18). "Website Lets You Make GLaDOS Say Whatever You Want". Kotaku. Kotaku. Archived from the original on 2021-01-17. Retrieved 2021-01-18.
- Ruppert, Liana (2021-01-18). "Make Portal's GLaDOS And Other Beloved Characters Say The Weirdest Things With This App". Game Informer. Game Informer. Archived from the original on 2021-01-18. Retrieved 2021-01-18.
- Jorge Jimenez (January 31, 2023). "AI company promises changes after 'voice cloning' tool used to make celebrities say awful things". PC Gamer. Retrieved 2023-02-03.
- https://news.yahoo.com/seeing-believing-global-scramble-tackle-013429757.html.
{{cite web}}
: Missing or empty|title=
(help)