AI Minds Newsletter
Posts
Jensen Huang talks Elon Musk’s Colossus, Mark Rober Judges Robot Chefs, and Why Ben Affleck’s bearish on LLM screenwriters

Jensen Huang talks Elon Musk’s Colossus, Mark Rober Judges Robot Chefs, and Why Ben Affleck’s bearish on LLM screenwriters

Mark Rober and Nick DiGiovanni collab on a robot cooking contest, Jensen Huang discusses Elon Musk's new Colossus supercomputer, and why Ben Affleck thinks LLMs don't stand a chance against human screenwriters and actors.

Jose Nicholas Francisco
November 19, 2024

Welcome (back) to AI Minds, a newsletter about the brainy and sometimes zany world of AI, brought to you by the Deepgram editorial team.

In this edition:

🎥 Mark Rober Judges a Robot’s Cooking against Nick DiGiovanni’s
💉 New Multimodal GenAI Copilot for Human Pathology
🎧 Latest, Cutting-Edge Advancements in Multimodal AI and Semantic Communications
🚓 TechCrunch: How a software engineer launched a police AI startup
🐦 Social Media Buzz: Jensen Huang’s thoughts on Elon Musk’s Colossus
🦠 Viral breakdown of DeepMind’s Med-PaLM M, a multimodal GenAI for medicine
💻 New Webinar on Voice AI solutions with Applied Engineer Brent George!
🚑 Why Medical Transcription is Hard for Humans and Machines (and how to fix it)
🎤 AI Minds Podcast with Pallavi Gadepalli, Founder & CEO of Enterprise Chai!
📽️ Why Ben Affleck thinks AI doesn’t stand a chance against actors or Shakespeare
⚒️ How to build the future: A Guide by Sam Altman
💾A deep-dive into Data Drift and Machine Learning
🚀 Why Agentic AI is such a big deal

Thanks for letting us crash your inbox; let’s party. 🎉

Deepgram released a brand new medical transcription model! Check it out here. 🥳

🎥 Mark Rober Judges a Robot’s Cooking against Nick DiGiovanni’s

Whether fully automated or controlled remotely via a VR headset, robots can do some amazing things. But how do they compare to humans when it comes to creative-yet-repetitive tasks? Find out in this video!

🧑‍🔬 Multimodal GenAI for Pathology and the Latest Advancement in Multimodal Semantic Communications

A multimodal generative AI copilot for human pathology - Despite the explosive growth of Generative AI, there have been few studies on building general-purpose multimodal AI assistants and copilots tailored to pathology. This paper therefore presents PathChat, a vision-language generalist AI assistant for human pathology.

Large AI Model Empowered Multimodal Semantic Communications - Multimodal signals, including text, audio, image, and video, can be integrated into Semantic Communication (SC) systems to provide an immersive experience with low latency and high quality at the semantic level. However, the multimodal SC has several challenges, including data heterogeneity, semantic ambiguity, and signal distortion during transmission. This paper proposes a unique solution to those problems called the “LAM-MSC framework.” Check it out!

From Elon Musk to cop car chases, how a software engineer launched a police AI startup tcrn.ch/4h59yFD
— TechCrunch (@TechCrunch)
2:02 PM • Oct 17, 2024

Jensen Huang: Elon is superhuman. What he and the xAI team achieved is unbelievable.
NVIDIA's CEO reflects on Elon Musk and the xAI team building Colossus, the world's fastest supercomputer, in just 19 days. He says any other company would take a full year.
“Building a massive… x.com/i/web/status/1…
— ELON DOCS (@elon_docs)
6:53 PM • Oct 28, 2024

Incredible news. The first Generalist Medical AI system is out.
DeepMind just announced Med-PaLM M, a Multimodal Generative AI model that understands:
1. Clinical language
2. Imaging
3. Genomics
The model reaches or surpasses SOTA on 14 different tasks all with the same set of… x.com/i/web/status/1…
— Lior⚡ (@LiorOnAI)
5:33 PM • Jul 30, 2023

🏇 New Webinar on Voice AI Solutions with Applied Engineer Brent George

Voice AI adoption is surging, but balancing security and compliance is critical, with 83% of enterprises moving workloads to private clouds for greater control. Join our webinar on November 19th to explore the advantages of on-premises Voice AI solutions and how to safeguard sensitive voice data and ensure compliance, without compromising real-time performance. Reserve your spot today!

When: Tuesday, November 19th at 10AM PT | 12 PM CT | 1 PM ET

Where: Online

Why Medical Transcription is Hard for Humans and Machines - Building artificial intelligence (AI) models that approach human medical transcriptionists' precision has been a long slog, but machines are gradually improving. Here’s why they’ll surpass human performance very soon.

🎤 The AI Minds Podcast!

Pallavi Gadepalli, Founder & CEO of Enterprise Chai, joins the AI Minds Podcast to discuss her journey from software developer to customer success visionary. With two decades of experience at Microsoft, eBay, ServiceNow, and Cisco, Pallavi has pioneered AI-driven solutions that transform real-time customer engagement.

🤖 Bonus Bits and Bytes!

If you’ve scrolled down this far, here are some bonus features for you!

🎥 Ben Affleck says AI doesn't stand a chance against actors or Shakespeare - On CNBC, Academy Award Winner Ben Affleck breaks down why movies will be the last thing replaced by AI. Do you think he’s right?
⚒️ How to Build the Future: Sam Altman - “In his latest essay Altman predicted that ASI (Artificial Super Intelligence) is just a few thousand days away. So how did we get to this point? Find out here!”
💾A deep-dive into Data Drift and Machine Learning: This glossary entry reveals the ins and outs of data drift, its significance in the machine learning landscape, and its distinguishable features from concept drift.
🚀 Why Agentic AI is such a big deal - A single trip to a conference inspired ML Researcher Jose Francisco to dive into the world of Agentic AI. Here’s what he uncovered.