SOTA Speech and Text Translation

PLUS: GPT 3.5 turbo finetuning and open source vision-language model.

Hello folks, here’s what we have today;

Our picks

1/

SeamlessM4T - The first, all-in-one, multimodal translation model from Meta. The model supports nearly 100 languages for input (speech + text), 100 languages for text output and 35 languages (plus English) for speech output. It outperforms Whisper on the benchmarks and is available for download on Github.

2/

GPT-3.5 Turbo fine-tuning and API updates. Open AI has finally made fine-tuning with GPT 3.5 Turbo available, with the GPT-4 fine-tuning coming in a few months. Plus, OpenAI is changing the API endpoint for old GPT-3 models with the previous endpoint deprecating on January 4th, 2024.

3/

IDEFICS - An open-access reproduction of Flamingo. Flamingo is the SOTA vision-language model from Deepmind. The team at Hugging Face has made a replication of it using Llama v1 and CLIP. IDEFICS is comparable in performance with the original closed-source Flamingo model across various image-text understanding benchmarks.

4/

Visualising AI - A unique project from Deepmind with 13 artists coming together to create artworks about AI.

from our sponsor

Take a demo, get a $150 Nike gift card.

Running out of patience for manual expense reports? Divvy can help you speed through expense reporting (like Nikes can help you speed through your workout). Take a demo to learn more, and we’ll send you a $150 Nike gift card.*

In just 20 minutes, you’ll discover how to:

  • Automate expense reports

  • Take control of your budget and spending

  • Access scalable business credit to grow your business

Claim your offer
*Terms and conditions apply: see offer page for more details

 From the community 
 Cool Tools  trending product launches from the last 24 hours
  • Accountabilabuddy by Summit - Your AI buddy that sends a simple text to motivate and hold you accountable.

  • Gamma 1.0 - AI text, visuals, and now images together in one tool for presentations.

  • AI office - Starter kit for building your own version of AI town.

  • GuestLab - Hours of guest research delivered in seconds.

  • Homation - Easy way to create a smart home with constructor.

  • Rainbow AI - Precise precipitation and rain forecasting.

  • Laxis - Repurpose audio into engaging content with a single click.

  • SEC Insights AI - Revolutionizing SEC document analysis.

  • Ollama - The easiest way to run LLMs locally.

  • ElevenLabs - The voice-generating tools launch out of beta.

 From the network 
 Ben’s Bites News  top posts from the last 24 hours

Ben’s Bites Insights

We have 2 databases that are updated daily which you can access by sharing Ben’s Bites using the link below;

  • All 10k+ links we’ve covered, easily filterable (1 referral)

  • 6k+ AI company funding rounds from Jan 2022, including investors, amounts, stage etc (3 referrals)

Join the conversation

or to participate.