Editing image with ease

PLUS: "ChatGPT isn't that innovative" from Meta's Chief AI Scientist

Hey everyone. This is Ben's Bites, We're the Chipotle of AI: fast, fresh, and always on the cutting-edge. Plus we always give you a little extra šŸ˜‰Ā 

I built an app demo that lets you analyse and qualify leadsĀ using Zapier and OpenAI. You can clone/remix it for yourself. Essentially when someone fills out a form, AI will tell you whether its important, neutral or unimportant. This could be used for a gazillion use cases. If you build anything with Zapier + OpenAI, ping me @bentossell on Twitter!

Let's get to it.

Have a product, service, job, event, newsletter, app, book, movie, tool, or anything you'd like to share with over 20k subscribers?Ā Check out our sponsor options.

šŸ¤Œ Ben's Picks

1/

If you find it as painfully uncomfortable to listen back to your own voice recordings as I do, then itā€™s time for you to despair, because voice synthesis AI is starting to gain traction amongst investor circles. ElevenLabs received a $2m pre-seed for their company that produces realistic text-to-audio voice synthesis. They also extend existing recordings in audio-to-audio development, with long-term plans for this to work across languages. Yes, thatā€™s right, your pain might one day be available in every language.

Itā€™ll be pretty cool for the third stage in the big AI wave to begin with synthetic voice generation (the first being generative art, the second being chat-bots). With the accelerating popularity of audio-books, which are currently very expensive to produce, there could be some serious revenue in this realm with equally serious societal benefits.

And for a sneak peek at a real life pitch deck for a real life tech start-up with real life VC funding, this report includes the ElevenLabs one!

2/

The first bit of big gun shade has officially been thrown, with Metaā€™s Chief AI Scientist Yann LeCun saying that ChatGPT isnā€™t that innovative, that lots of labs already do it, and Meta could and will do so too. Sounds a little bit sour, and reminds me of the Big Short quote ā€œbeing late is the same as being wrongā€. Yann, perhaps youā€™re just a little bit late? (To be fair, he does kind of have a pointā€¦)

3/

Is Artificial Domain Intelligence the route to commercialising AGI? Iā€™ve been enjoying the pieces of discussion that have been circulating on domain specialisation using generative AI models, and Lead Data Scientist for FinCrime at KPMG summarises it well here.

4/

Playground AI announces AI-first image editing. Instruct an AI to synthesise spectacular yet subtle edits. We're seeing a lot of products implementing InstructPix2Pix - a method for editing images from human instructions, which I believe is what's being used here. (link)

šŸ”„ Launched TodayĀ šŸ”„Ā (Sponsor)

Gallery of Automated Artistry is giving away their first shirt #0001/1825 on Twitter - potentially owning a piece of history sounds pretty cool.

The first-ever AI-generated high-end fashion brand in collaboration with top creatives (1.5M followers) šŸ–¼ļø

Each design is serialized and only ever sold once.

They're giving away a piece of AI history, check out their first tweet. (link)

šŸ› ļø Cool Tools

  • Character AI - Generate realistic, intelligent, and interactive AI characters. (link)

  • AI Tools FYI - Find amazing AI tools that make your life easy! (link)

  • Stylized - Virtual product staging to create photos that sell. (link)

  • Mad Genius - Directory of 500+ AI tools updated daily. (link)

  • Algo - Conversational AI chatbot with less chatter and more control over personal data. (link)

  • Cover Letter AI - Analyse your resume and write a cover letter that highlights info relevant to the job description. (link)

  • Drayk.it - Make AI drake songs about anything. (link)

  • Backend GPT - Using LLMs as backend. Scale AI hackathonā€™s 1st place project. (link)

  • Rephraser - Text rephrasing tool using OpenAI's API. (link)

  • Magician by Diagram - A design tool for Figma powered by AI is now in public beta. (link)

  • Tome, the AI-powered storytelling format, has unveiled a multimodal vision for AI and released a new suite of generative storytelling features built for intuitive collaboration with AI, including text rewriting, length and tone adjustments, and generative prompt bar customisation. (link)

šŸ¤“ Miscellaneous

  • OpenAI and Microsoft extend partnership. (link)

  • Alexa & ChatGPT - A match made in AI heaven. (link)

  • The AI assist - A venture capitalists experience with AI as he comes back to coding. (link)

  • Underfitted Talent Collective - A community of hand-curated AI professionals open to new opportunities. (link)

  • The human-AI partnership - ChatGPT and Reid Hoffman talk about how AI amplifies human potential. (link)

  • How Microsoftā€™s stumbles led to its OpenAI alliance. (link)

  • The era of AI-first products. (link)

šŸŽ“ Learn

  • How ChatGPT actually works? (link)

  • Build a game using ChatGPT and Replit. (link)

šŸ‘‹ Too many links?! I created a database for all links mentioned in these emails. Refer 1 friend using this link and I'll send over the link database.

šŸ”¬ Research

  • H3 - Generative language modelling with only 2 attention layers. (link) One of the authors explains the model in this thread.

  • Green hierarchical vision transformer for masked image modelling. (link)

  • HexPlane: A fast representation for dynamic 3D scenes by 6 planes only. (link)

  • Summarise the past to predict the future - Natural language descriptions of context can forecast the next object interactions. (link)

  • BallGAN - 3D-aware image synthesis with a spherical background. (link)

  • Is ChatGPT a good translator? Results: yes for high-resource European languages, not much for low resource or distant languages. (link)

  • InfiniCity - Constructing and rendering an unconstrained, large and 3D-grounded environment from random noises. (paper, demo)

  • Zorro: The masked multimodal transformer. (link)

  • LEGO-Net: Learning regular rearrangements of objects in rooms. (link)

  • StyleGAN-T: Unlocking the power of GANs for fast, large scale text-to-image synthesis. (link)

  • Batch prompting for efficient inference with LLM APIs. (link)

šŸ“° Unclassifieds

Have a product, service, job, event, newsletter, app, book, movie, tool, or anything you'd like to share with over 20k subscribers?Ā Check out our sponsor options.

šŸ–¼ AI images of the day

šŸ¤— Share Ben's Bites

Send this to 1 AI-curious friend and receive my AI project tracker database!

or copy/paste this link: https://bensbites.beehiiv.com/subscribe?ref=PLACEHOLDER

ā­ļø How did we do?

How was today's email?

Login or Subscribe to participate in polls.

ā­ļø REAL REVIEWS

Join the conversation

or to participate.