• Ben's Bites
  • Posts
  • Microsoft’s situationship with OpenAI

Microsoft’s situationship with OpenAI

PLUS: Datasets and Copyright.

Hello folks, here’s what we have today;

 Our picks 

1/

Microsoft plans AI service with Databricks that could hurt OpenAI. MSFT’s latest Azure+Databricks offer will allow Databricks users to use any AI model, including open-source LLMs, to train using their data on Azure—which might reduce the number of companies licensing OpenAI models for the same use case.

2/

Alex Reisner from The Atlantic did an analysis of Books3 - A dataset used to train Meta's Llama, BloombergGPT, and EleutherAI's GPT-J. He reveals it contains pirated versions of 170K+ books from Stephen King and other authors. Of the 170,000 titles, roughly one-third are fiction, and two-thirds are nonfiction. More than 30,000 titles are from Penguin Random House and 14,000 from HarperCollins.

3/

How to lose an AI copyright case? This lawyer says by saying that your AI tools are fully autonomous. If you say that your AI tool is fully autonomous and you want to copyright something generated by it, it is a fundamentally wrong case for copyright. (which is what happened in the recent case about a judge ruling that AI art can not be copyrighted)

4/

The Allen Institute of AI drops the biggest open dataset yet for training language models. DOLMA 3T is larger than Meta’s Llama 2’s 2T token dataset, with straightforward permissions to use.

 From the community 
 Cool Tools  trending product launches from the last 24 hours
  • GodMode - The AI chat browser. Fast, free access to ChatGPT, Bing, Bard, Claude, YouChat, Poe, Perplexity, Phind, and Local/GGML Models like Vicuna and Alpaca.

  • Poozle - open-source Plaid for LLMs.

  • Strada - Developer-first, enterprise integration platform.

  • Chapple - A one-stop AI-powered content creation tool.

  • ThoughtCast - Craft and share compelling audio pitches, blogs, and more.

  • Langfuse - Open source tracing and analytics for LLM applications.

  • Cerelyze - Turn technical research papers into usable code.

  • Vexis - Unbiased, accurate grading to free up time for educators to teach.

  • NeoGPT - Steerable AutoGPT where you & AI agents collaborate on complex tasks in real-time.

  • Recursive document agents from LlamaIndex - Ask and answer more questions over heterogeneous documents. (ps: I’m an investor)

 From the network 
 Ben’s Bites News  top posts from the last 24 hours

Ben’s Bites Insights

We have 2 databases that are updated daily which you can access by sharing Ben’s Bites using the link below;

  • All 10k+ links we’ve covered, easily filterable (1 referral)

  • 6k+ AI company funding rounds from Jan 2022, including investors, amounts, stage etc (3 referrals)

Reply

or to participate.