• AI Minds Newsletter
  • Posts
  • Jensen Huang talks Elon Musk’s Colossus, Mark Rober Judges Robot Chefs, and Why Ben Affleck’s bearish on LLM screenwriters

Jensen Huang talks Elon Musk’s Colossus, Mark Rober Judges Robot Chefs, and Why Ben Affleck’s bearish on LLM screenwriters

Mark Rober and Nick DiGiovanni collab on a robot cooking contest, Jensen Huang discusses Elon Musk's new Colossus supercomputer, and why Ben Affleck thinks LLMs don't stand a chance against human screenwriters and actors.

Welcome (back) to AI Minds, a newsletter about the brainy and sometimes zany world of AI, brought to you by the Deepgram editorial team.

In this edition:

  • 🎥  Mark Rober Judges a Robot’s Cooking against Nick DiGiovanni’s

  • 💉 New Multimodal GenAI Copilot for Human Pathology

  • 🎧 Latest, Cutting-Edge Advancements in Multimodal AI and Semantic Communications

  • 🚓 TechCrunch: How a software engineer launched a police AI startup

  • 🐦 Social Media Buzz: Jensen Huang’s thoughts on Elon Musk’s Colossus

  • 🦠 Viral breakdown of DeepMind’s Med-PaLM M, a multimodal GenAI for medicine

  • 💻 New Webinar on Voice AI solutions with Applied Engineer Brent George!

  • 🚑 Why Medical Transcription is Hard for Humans and Machines (and how to fix it)

  • 🎤 AI Minds Podcast with Pallavi Gadepalli, Founder & CEO of Enterprise Chai!

  • 📽️ Why Ben Affleck thinks AI doesn’t stand a chance against actors or Shakespeare

  • ⚒️ How to build the future: A Guide by Sam Altman

  • 💾A deep-dive into Data Drift and Machine Learning

  • 🚀 Why Agentic AI is such a big deal

Thanks for letting us crash your inbox; let’s party. 🎉

Deepgram released a brand new medical transcription model! Check it out here. 🥳

🎥  Mark Rober Judges a Robot’s Cooking against Nick DiGiovanni’s

Whether fully automated or controlled remotely via a VR headset, robots can do some amazing things. But how do they compare to humans when it comes to creative-yet-repetitive tasks? Find out in this video!

🧑‍🔬  Multimodal GenAI for Pathology and the Latest Advancement in Multimodal Semantic Communications

A multimodal generative AI copilot for human pathology - Despite the explosive growth of Generative AI, there have been few studies on building general-purpose multimodal AI assistants and copilots tailored to pathology. This paper therefore presents PathChat, a vision-language generalist AI assistant for human pathology.

Large AI Model Empowered Multimodal Semantic Communications - Multimodal signals, including text, audio, image, and video, can be integrated into Semantic Communication (SC) systems to provide an immersive experience with low latency and high quality at the semantic level. However, the multimodal SC has several challenges, including data heterogeneity, semantic ambiguity, and signal distortion during transmission. This paper proposes a unique solution to those problems called the “LAM-MSC framework.” Check it out!

🐝 Social Media Buzz: Jensen Huang’s thoughts on Elon Musk’s Colossus

🏇 New Webinar on Voice AI Solutions with Applied Engineer Brent George

Voice AI adoption is surging, but balancing security and compliance is critical, with 83% of enterprises moving workloads to private clouds for greater control. Join our webinar on November 19th to explore the advantages of on-premises Voice AI solutions and how to safeguard sensitive voice data and ensure compliance, without compromising real-time performance. Reserve your spot today!

When: Tuesday, November 19th at 10AM PT | 12 PM CT | 1 PM ET

Where: Online

Why Medical Transcription is Hard for Humans and Machines - Building artificial intelligence (AI) models that approach human medical transcriptionists' precision has been a long slog, but machines are gradually improving. Here’s why they’ll surpass human performance very soon.

🎤 The AI Minds Podcast!

Pallavi Gadepalli, Founder & CEO of Enterprise Chai, joins the AI Minds Podcast to discuss her journey from software developer to customer success visionary. With two decades of experience at Microsoft, eBay, ServiceNow, and Cisco, Pallavi has pioneered AI-driven solutions that transform real-time customer engagement.

🤖 Bonus Bits and Bytes!

If you’ve scrolled down this far, here are some bonus features for you!