Wednesday, October 2, 2024

OpenAI inks deal to coach AI on Reddit information

OpenAI has reached a deal with Reddit to make use of the social information website’s information for coaching AI fashions.

In a weblog publish on OpenAI’s press relations website, the corporate mentioned that the Reddit partnership will present it entry to “real-time, structured and distinctive content material” — e.g. posts and replies — from Reddit, permitting its instruments and fashions to “higher perceive and showcase” that content material. Reddit content material shall be integrated into ChatGPT, OpenAI’s standard conversational AI, and the businesses will work collectively to convey unspecified new “AI-powered options” to each Reddit customers and moderators.

OpenAI may also grow to be a Reddit promoting companion.

“Reddit shall be constructing on OpenAI’s platform of AI fashions to convey its highly effective imaginative and prescient to life,” OpenAI wrote within the publish. “Utilizing LLMs, ML, and AI enable Reddit to enhance the consumer expertise for everybody.”

OpenAI has a number of comparable licensing offers with content material suppliers starting from inventory media libraries to information publishers. However the uncommon angle to this one is that Sam Altman, OpenAI’s CEO, has an 8.7% stake in Reddit, making him the third-largest shareholder, and was as soon as a member of the corporate’s board of administrators.

In an try to discourage scrutiny, OpenAI says in its press launch that, whereas Altman stays a Reddit shareholder, the partnership “was led by OpenAI’s COO [Brad Lightcap]” and “accepted by [OpenAI’s] impartial board of administrators.” (I’ll notice right here that Altman is a member of OpenAI’s board; he recused himself for this resolution, nonetheless, an OpenAI spokesperson tells TechCrunch.)

Reddit has made information licensing agreements an more and more central a part of its development technique because it navigates the market as a public firm.

In its IPO prospectus, Reddit revealed that it has contractual agreements to license its information to prospects together with Google price a mixed over $200 million. And, in its first earnings report as a public firm, Reddit reported a 450% year-over-year enhance in non-ad income, attributable primarily to these agreements.

Reddit inventory was up 11% in prolonged buying and selling following the announcement of the OpenAI deal.

“The paradox I see is that, as extra content material on the web is written by machines, there’s an growing premium on content material that comes from actual folks,” Reddit CEO Steve Huffman mentioned through the firm’s earnings name in March. “And we’ve almost 20 years of genuine dialog.”

Reddit’s platform — which has over 1 billion posts and greater than 16 billion feedback, figures that develop day-after-day due to its a whole lot of thousands and thousands of energetic customers — is a gold mine for generative AI corporations, whose fashions study from examples of content material, like textual content and pictures, to generate new, comparable content material.

However the firm might face pushback from customers involved about the way it’s monetizing their information.

It’s instructive to have a look at Stack Overflow, the Q&A discussion board for software program builders, which not too long ago inked an settlement with OpenAI to provide information for the latter’s mannequin coaching. In protest, some customers deleted their top-rated solutions to questions on the group. However Stack Overflow restored the deleted posts and banned these customers, claiming that they weren’t in compliance with its phrases of service.

Reddit has already voiced its displeasure with one try to afford Reddit customers better management over their very own information.

Vana, a startup constructed on the blockchain, is trying to launch a knowledge “DAO” (Digital Autonomous Group) to let Reddit customers pool their information and allow them to determine collectively how that mixed information’s used (or offered). Reddit banned Vana’s subreddit devoted to dialogue concerning the DAO, in an announcement to TechCrunch, and accused the corporate of “exploiting” its information export controls.

We’re launching an AI e-newsletter! Join right here to begin receiving it in your inboxes on June 5.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles