- Ben's Bites
- Posts
- Editing image with ease
Editing image with ease
PLUS: "ChatGPT isn't that innovative" from Meta's Chief AI Scientist
Hey everyone. This is Ben's Bites, We're the Chipotle of AI: fast, fresh, and always on the cutting-edge. Plus we always give you a little extra š
I built an app demo that lets you analyse and qualify leads using Zapier and OpenAI. You can clone/remix it for yourself. Essentially when someone fills out a form, AI will tell you whether its important, neutral or unimportant. This could be used for a gazillion use cases. If you build anything with Zapier + OpenAI, ping me @bentossell on Twitter!
Let's get to it.
Have a product, service, job, event, newsletter, app, book, movie, tool, or anything you'd like to share with over 20k subscribers? Check out our sponsor options.
š¤ Ben's Picks
1/
If you find it as painfully uncomfortable to listen back to your own voice recordings as I do, then itās time for you to despair, because voice synthesis AI is starting to gain traction amongst investor circles. ElevenLabs received a $2m pre-seed for their company that produces realistic text-to-audio voice synthesis. They also extend existing recordings in audio-to-audio development, with long-term plans for this to work across languages. Yes, thatās right, your pain might one day be available in every language.
Itāll be pretty cool for the third stage in the big AI wave to begin with synthetic voice generation (the first being generative art, the second being chat-bots). With the accelerating popularity of audio-books, which are currently very expensive to produce, there could be some serious revenue in this realm with equally serious societal benefits.
And for a sneak peek at a real life pitch deck for a real life tech start-up with real life VC funding, this report includes the ElevenLabs one!
2/
The first bit of big gun shade has officially been thrown, with Metaās Chief AI Scientist Yann LeCun saying that ChatGPT isnāt that innovative, that lots of labs already do it, and Meta could and will do so too. Sounds a little bit sour, and reminds me of the Big Short quote ābeing late is the same as being wrongā. Yann, perhaps youāre just a little bit late? (To be fair, he does kind of have a pointā¦)
3/
Is Artificial Domain Intelligence the route to commercialising AGI? Iāve been enjoying the pieces of discussion that have been circulating on domain specialisation using generative AI models, and Lead Data Scientist for FinCrime at KPMG summarises it well here.
4/
Playground AI announces AI-first image editing. Instruct an AI to synthesise spectacular yet subtle edits. We're seeing a lot of products implementing InstructPix2Pix - a method for editing images from human instructions, which I believe is what's being used here. (link)
š„ Launched Today š„ (Sponsor)
Gallery of Automated Artistry is giving away their first shirt #0001/1825 on Twitter - potentially owning a piece of history sounds pretty cool.
The first-ever AI-generated high-end fashion brand in collaboration with top creatives (1.5M followers) š¼ļø
Each design is serialized and only ever sold once.
They're giving away a piece of AI history, check out their first tweet. (link)
š ļø Cool Tools
Character AI - Generate realistic, intelligent, and interactive AI characters. (link)
AI Tools FYI - Find amazing AI tools that make your life easy! (link)
Stylized - Virtual product staging to create photos that sell. (link)
Mad Genius - Directory of 500+ AI tools updated daily. (link)
Algo - Conversational AI chatbot with less chatter and more control over personal data. (link)
Cover Letter AI - Analyse your resume and write a cover letter that highlights info relevant to the job description. (link)
Drayk.it - Make AI drake songs about anything. (link)
Backend GPT - Using LLMs as backend. Scale AI hackathonās 1st place project. (link)
Rephraser - Text rephrasing tool using OpenAI's API. (link)
Magician by Diagram - A design tool for Figma powered by AI is now in public beta. (link)
Tome, the AI-powered storytelling format, has unveiled a multimodal vision for AI and released a new suite of generative storytelling features built for intuitive collaboration with AI, including text rewriting, length and tone adjustments, and generative prompt bar customisation. (link)
š¤ Miscellaneous
OpenAI and Microsoft extend partnership. (link)
Alexa & ChatGPT - A match made in AI heaven. (link)
The AI assist - A venture capitalists experience with AI as he comes back to coding. (link)
Underfitted Talent Collective - A community of hand-curated AI professionals open to new opportunities. (link)
The human-AI partnership - ChatGPT and Reid Hoffman talk about how AI amplifies human potential. (link)
How Microsoftās stumbles led to its OpenAI alliance. (link)
The era of AI-first products. (link)
š Learn
š Too many links?! I created a database for all links mentioned in these emails. Refer 1 friend using this link and I'll send over the link database.
š¬ Research
H3 - Generative language modelling with only 2 attention layers. (link) One of the authors explains the model in this thread.
Green hierarchical vision transformer for masked image modelling. (link)
HexPlane: A fast representation for dynamic 3D scenes by 6 planes only. (link)
Summarise the past to predict the future - Natural language descriptions of context can forecast the next object interactions. (link)
BallGAN - 3D-aware image synthesis with a spherical background. (link)
Is ChatGPT a good translator? Results: yes for high-resource European languages, not much for low resource or distant languages. (link)
InfiniCity - Constructing and rendering an unconstrained, large and 3D-grounded environment from random noises. (paper, demo)
Zorro: The masked multimodal transformer. (link)
LEGO-Net: Learning regular rearrangements of objects in rooms. (link)
StyleGAN-T: Unlocking the power of GANs for fast, large scale text-to-image synthesis. (link)
Batch prompting for efficient inference with LLM APIs. (link)
š° Unclassifieds
Learn to build AI apps, even if you're not a developer. Create your own AI writer with GPT-3 and an image generator with Stable Diffusion.
Social Keyboard utilizes the "most useful" NLP products to help you better communicate with others. Available on iOS and Android.
Take notes, set reminders, and get AI generated summaries of your favorite newsletters with Apiary, an inbox built for newsletters. Join Now.
Who knows LangChain and GPTIndex cold and is interested in a paid tutoring gig?
Have a product, service, job, event, newsletter, app, book, movie, tool, or anything you'd like to share with over 20k subscribers? Check out our sponsor options.
š¼ AI images of the day
š¤ Share Ben's Bites
Send this to 1 AI-curious friend and receive my AI project tracker database!
or copy/paste this link: https://bensbites.beehiiv.com/subscribe?ref=PLACEHOLDER
āļø How did we do?
How was today's email? |
Reply