- Ben's Bites
- Posts
- Daily Digest: $10M for Math Geeks
Daily Digest: $10M for Math Geeks
PLUS: get your AI to behave like you want
Sign up | Advertise | Ben’s Bites News
Daily Digest #294
Hello folks, here’s what we have today;
PICKS
The AI-MO Prize is a new $10 million challenge fund to find AI model that can get a gold medal at the International Mathematics Olympiad. It’s launched by XTX Markets to accelerate the development of AI models that can reason mathematically and solve problems at a level comparable to high-performing humans.🍿Our Summary (also below)
Scale collaborates with NVIDIA with NeMo SteerLM - Scale AI is collaborating with NVIDIA to create the next generation of generative AI models using NVIDIA's SteerLM technique and Scale's high-quality training datasets. The goal is to help developers make apps more aligned towards their outcomes.🍿Our Summary (also below)
from our sponsor
Is navigating complex data privacy regulations overwhelming?
Pyxos is your AI-driven data privacy & compliance partner. It’s not just another tool, it's an intelligent system that simplifies GDPR, CCPA complexities.
Empower your business with a seamless data compliance experience, foster responsible data usage, and accelerate innovation.
Enjoy your exclusive early access to Pyxos by clicking here.
TOP TOOLS
Magnific - The image upscaler & enhancer that feels like Magic.
Trash baby - Chaotic AI image mixing.
Fal - AI inference faster than you can type.
ExcaliDraw - Virtual whiteboard for sketching hand-drawn diagrams.
Sorcerer by Antimatter Systems - Asks great questions, differentiated for every learner.
AI Resume Checker by PDF AI - Use AI to roast/review your resume and receive feedback to improve it.
Alfred - Answer customer questions from your data, instantly.
NEWS
Reshaping the tree - rebuilding organizations for AI.
How Jensen Huang’s Nvidia is powering the AI revolution.
Fireside Chat with Raza Habib, CEO of Humanloop.
Self-Operating Computer - A framework to enable multimodal models to operate a computer.
Launch production-grade architectures using Pinecone and AWS.
Betaworks’s next camp program is all about agents. Must read if you’re building in that area.
QUICK BITES
The AI-MO Prize is a new $10 million challenge fund launched by XTX Markets to accelerate the development of AI models that can reason mathematically and solve problems at a level comparable to high-performing humans.
What is going on here?
The $10M AIMO fund puts up a challenge to get AI to do math (and get a gold medal).
What does this mean?
AIMO fund will award progress prizes for models that achieve milestones, plus a $5 million grand prize for the first publicly shared model that wins a gold medal equivalent in an approved AI math competition.
The first approved competitions, with progress presentations at the 2024 International Math Olympiad (IMO), will help compare different AI problem-solving strategies in a way accessible to the general public.
As per Fields medalist and IMO winner Terence Tao, people would be eager to know if AI can match the world’s brightest young mathematical minds. You bet we would.
Why should I care?
Reasoning and planning are two major drawbacks of models like GPTs (autoregressive LLMs). IMO problems require both at a great level to solve them (there’s an average of 3 hours of time to solve a single question).
The AI-MO Prize connects AI progress to this benchmark of mathematical excellence. IMO gold medalists become leaders in science, engineering and technology, so an AI matching that achievement could profoundly impact these fields.
QUICK BITES
Scale AI is collaborating with NVIDIA to create the next generation of generative AI models using NVIDIA's NeMo SteerLM technique and Scale's high-quality training datasets.
What is going on here?
SteerLLM from Nvidia and Scale AI will help developers make apps more aligned towards their outcomes.
What does this mean?
SteerLM comes with an open-source evaluation dataset with 37k samples and different dimensions like helpfulness, correctness, complexity and verbosity.
SteerLM allows developers to dynamically customize model behaviour through easy-to-adjust attributes instead of full retraining. For example, in education applications, SteerLM can tailor model complexity and verbosity to individual learning needs. In gaming, SteerLM enables shaping NPC personality and emotional range.
Why should I care?
Making LLMs perform well in industry-specific scenarios is the target for many providers at the moment. Two of the major roadblocks in achieving that are a) access to industry-specific, custom-labelled, high-quality data and b) readymade frameworks with options to crank up/down certain behaviours.
Scale and Nvidia’s partnership is one solution for both of these. I’d bet on getting more of such options to come out of LLM fine-tuning/deployment companies in the next months.
Ben’s Bites Insights
We have 2 databases that are updated daily which you can access by sharing Ben’s Bites using the link below;
All 10k+ links we’ve covered, easily filterable (1 referral)
6k+ AI company funding rounds from Jan 2022, including investors, amounts, stage etc (3 referrals)
Reply