Ben's Bites
Posts
SCOOP: AI Book Search

SCOOP: AI Book Search

🎬 AI-generated Marvel movies 🎥 Process, understand and search video 🤔 Simplifying AI terminology 📊 Generated data charts

Ben Tossell
November 15, 2022

Hey everyone, and welcome to the 121 new folk joining us since yesterday.

A few of you have asked, so I've created an AI job board + talent collective. If you're looking for AI job opportunities, sign up here (it's free) and I'll match you with companies hiring (you can remain anonymous too). If your company is hiring, you can post jobs and get access to the talent here. It's early so expect this to grow over time. Everything is vetted by me.

Let's get into it!

p.s. I keep forgetting to actually link the Discord if you're interested.

Prompt: List the topics in today’s email:

🎬 AI-generated Marvel movies🎥 Process, understand and search video🤔 Simplifying AI terminology📊 Generated data charts

🫦 BEN'S BITES

We got the scoop! Sahil Lavingia (founder of Gumroad) has launched Ask My Book. It’s being launched later today but he gave me the all-clear to announce it here first. Sahil wrote a book ‘The Minimalist Entrepreneur’ and has built this tool so that you can ask it any question and it will answer it in real-time, with a voice cloned from Sahil’s. It’s a really impressive way to see how AI tools combined can create a really helpful experience. He’s hosting a webinar on Friday that explains how he built it.

A demo of Game art made easy - Simple Stable Diffusion to Puppet: Stable diffusion -> concept art -> Automatic Puppet 2d Animation

Universal image segmentation, and how it can be used to improve performance across multiple image segmentation tasks. A truly universal framework should be trained only once and be able to achieve state-of-the-art performance. The OneFormer framework proposed in the article is a step towards making image segmentation more universal and accessible.

LumaAI is hiring for multiple roles!

👋 Too many links?! I created a database for all links mentioned in these emails. Refer 1 friend using this link and I'll send over the link database.

DreamArtist: With just one training image it learns the content and style in it, generating diverse high-quality images with high controllability.

A good podcast with Alex Wang of Scale AI on the state of AI, startup building, AI in defence + ethics and learning to think, the AI moats and more.

A tutorial on how Corridor Crew put Tom Holland into The Spider-Verse.

The next Google search engine will be Generative AI. The current design of search engines is based on outdated technology. AI could be used to create a new type of search engine that is more efficient. But, training a model is expensive.

A unified model for three motion and 3D perception tasks.

Any Roam lovers in the house? Use machine learning and graph theory to gain insights into your notes.

Sieve raised $4m and launched its AI API Beta. Sieve is a platform that makes it easy to process, understand, and search video, and it can be used to create interactive features without setting up a separate backend.

I wrote a little thread on simplifying AI terminology.

Create your own AI-powered glossary.

How fast can you speak English, without making mistakes? Take the Rap God Test (read a Wikipedia passage quickly) and get judged by OpenAI Whisper STT scoring your speech's intelligibility. Each error you make adds 1s to your time.

Create your own Avatar story with this Diffusion model.

Generating Images with OpenAI’s DALL-E via Data Fetcher and Airtable

Podcast version of Mario’s “What to Watch in AI” article.

Generated music from the most iconic album covers, using Img-to-Music.

Scott Belsky’s take on Generative AI: what disruptions are overblown, the perils of building start-ups (or features) on others’ models, and what will (and will never) change.

The State of Multilingual AI. How multilingual are current models in NLP, computer vision, and speech? What are the main recent contributions in this area? What challenges remain and how can we address them?

How AI will disrupt productivity tools. Admin structure, generative research, smart insights & suggestions, and task actions.

Does heavier use of AI assistance result in more vulnerable code? It seems so!

Chula is a tool that lets you type what your chart should show, it’ll find the data and create it.

A new method for pretraining audio representations by combining audio data with natural language descriptions.

An end-to-end locomotion system that is capable of traversing stairs, curbs, stepping stones, and gaps. The system is trained in simulation and transfers to the real world without any fine-tuning.

A new system for decoding visual stimuli from brain recordings, called MinD-Vis. The system is designed to improve upon existing methods by providing more faithful details and meaningful semantics in the reconstructed images.

A new text-to-image model called Paella requires less than 10 steps to generate high-fidelity images.

The use of score distillation to text-guide a NeRF model to generate a 3D object.

How generative AI is changing creative work.

An AI tool directory - Futurepedia.