OpenAI’s Mira Murati is “not sure” where Sora’s training data comes from
The data source of OpenAI’s upcoming video-generating artificial intelligence model, Sora, is unclear to the company’s chief technology officer, Mira Murati.
During an interview with The Wall Street Journal published on March 13, Murati offered vague responses when asked about the source of data for the company’s Sora model, which is capable of generating videos from text instructions.
“We used publicly available data and licensed data,” replied Murati about how the company valued at $80 billion was training its upcoming model.
Joanna Stern, from the Journal, then asked whether Sora was trained with data from social media platforms, such as YouTube, Instagram, or Facebook. “I’m actually not sure about that,” Murati replied, adding:
“You know, if they were publicly available — publicly available to use. But I’m not sure. I’m not confident about it.”
Before moving to another topic, Stern mentioned OpenAI’s partnership with stock image company Shutterstock, asking if its data could be used to train Sora. “I’m just not going to go into detail about the data that was used. But it was publicly available or licensed data,” Murati added. Later, she confirmed to the Journal that Shutterstock data was used for Sora.
AI models are trained using large sets of data, known as training data sets, which help the model learn to recognize patterns, make predictions, or understand language.
OpenAI's CTO Mira Murati during interview with The Wall Street Journal. Source: WSJMurati has been at OpenAI since 2018, leading some of the company’s most popular projects, including the image-generator model DALL-E 3, the speech-recognition tool Whisper and the latest version of the company’s chatbot GPT-4. In November 2023, she briefly took over as interim CEO after OpenAI’s board ousted Sam Altman.
OpenAI has been targeted by several legal actions involving its AI models’ training data. In July 2023, authors Sarah Silverman, Richard Kadrey, and Christopher Golden filed a lawsuit against the company , alleging that ChatGPT generates summaries of the authors’ works based on copyrighted content.
In December, The New York Times sued Microsoft and OpenAI in a similar copyright infringement complaint that alleges the companies used the newspaper’s content to train AI chatbots. A different class-action lawsuit was filed in California , alleging that OpenAI scraped private user information from the internet to train ChatGPT without user consent.
Magazine: Inside Pink Drainer — Security analyst defends his crypto scam franchise
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Crypto Trader Turns $296 into 209,000% Profit in Hours
Ripple founder has donated more than $11.8 million to Harris
Share link:In this post: Ripple’s chairman, Chris Larsen, has reportedly donated over $11 million toward the U.S. vice president’s campaign. Recent election data indicates the crypto industry donated more to Kamala Harris ahead of the United States general elections. According to a recent report by Public Citizen, almost half of the corporate funds donated to the election campaigns are from the crypto industry.
Japan cracks Monero money laundering case, arrests 18
Share link:In this post: Japan’s police arrested 18 people, including Yuta Kobayashi, for laundering over 100 million yen using Monero, a privacy-focused cryptocurrency. Kobayashi’s group used stolen credit card info and made 900 fraudulent transactions through the flea market app Mercari. Japan’s crypto market is booming with 3.7 million active accounts, but fraud remains a major issue despite strict regulations.
Silent Hill 2 Remake update 1.04 released with improvements in technical performance
Share link:In this post: Silent Hill 2 Remake receives its first major update. Bloober Team has addressed several major bugs and issues in the gameplay. The new update also brings about technical improvements