Loading...
Loading...
0 / 10 episodes
No episodes yet
Tap + Later on any episode to add it here.
Jon Krohn
My guest today took a public school that was about to be shut down and turned it into the number one school in Boston, and AI is her latest secret weapon. In a long-overdue episode on AI for supporting children’s education, hear directly from Principal Traci Walker Griffith how her teachers have been experimenting with AI in classrooms, what works, what doesn’t work, and what’s next for kids as LLMs continue to improve. Additional materials: www.superdatascience.com/983 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:38) The Eliot School’s transformation from closure list to number one in Boston (08:54) How the school began using Claude for AI-assisted writing feedback (18:14) How younger students benefit from AI behind the scenes (23:46) How older students interact with AI directly (41:11) Three prompt engineering failure modes and how to fix them (55:29) Responding to the Brookings report on AI risks in education
Jon Krohn rounds up March’s interviews in this ICYMI episode. Hear from AI and data science experts across the fields of education and business in this wide-ranging series of clips that take listeners from the Renaissance to the near future. Guests include Lin Quiao (Episode 971), Chris Fregly (Episode 973), Zack Kass (Episode 975), Kyunghyun Cho (Episode 977), and Rohit Choudhary (Episode 979). Additional materials: www.superdatascience.com/982 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Matt Glickman talks to Jon Krohn about co-founding the agentic-platform startup, Genesis Computing, how his experience at Goldman Sachs paved the way for developing AI agents, and where he thinks agentic AI has just as much value as a company’s human employees. This February, Genesis Computing revealed how its platform can offer the guardrails so crucial to businesses, alongside increased capabilities that help execute entire workflows from research to deployment. Additional materials: www.superdatascience.com/981 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (12:56) Cloud adoption in finance and healthcare (18:28) How Genesis Computing uses AI agents (31:05) AI agents replacing humans in the workplace (56:25) An argument for encouraging enterprises to use AI
A team of theoretical physicists from Harvard, Cambridge, the Institute for Advanced Study, and Vanderbilt used OpenAI’s models not just as a tool, but as a collaborator, cracking a problem in particle physics that had stymied them for months. In this Five-Minute Friday, Jon Krohn walks through how GPT-5.2 Pro simplified a 32-variable mathematical expression into a single line, proposed what it called the “obvious generalization” for any number of gluons, and how a more powerful internal model then produced a formal proof after 12 hours of autonomous reasoning. Find out why this may be a template for AI-assisted scientific discovery and what it means for the future of research. Additional materials: www.superdatascience.com/980 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
For years, Jon has been quoting the stat that the world's data is roughly doubling every year. His guest today says that’s way too conservative, he’s seeing enterprise data soon growing at close to 10x per year. And most organizations are nowhere near ready for what that means. In this episode, Rohit Choudhary, founder and CEO of Acceldata, explains how the agentic data management platform his team has built helps enterprises make their increasingly vast amounts of data self-aware, self-optimizing, and AI-ready. He breaks down why governance needs to be operational and real-time rather than a one-time compliance exercise, and shares his view on why the most valuable professionals in the age of AI won’t be the best programmers, they’ll be the ones with the clearest thinking and the deepest domain expertise. Additional materials: www.superdatascience.com/979 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:26) How Rohit coined the term “data observability” (06:04) Agentic data management use cases (12:46) Why fixing data at the point of consumption is 1000x more expensive (30:49) Career paths and skills for the age of AI (42:38) Why enterprise data will soon grow at nearly 10x per year
A game millions of people solve over morning coffee is exposing a fundamental weakness in today’s most powerful AI models. In this Five-Minute Friday, Jon Krohn breaks down Pathway’s new Sudoku Extreme benchmark, roughly 250,000 of the hardest Sudoku puzzles available and why leading LLMs like o3-mini, DeepSeek-R1, and Claude 3.7 Sonnet scored effectively zero percent, while Pathway’s post-transformer BDH architecture achieved 97.4% accuracy at a fraction of the cost. Listen to the episode to find out what BDH is doing differently, why Sudoku performance matters far beyond puzzles, and what this means for the future of AI reasoning. Additional materials: www.superdatascience.com/978 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
What’s going to be the next big step function that blasts us forward in AI capabilities? To find out, Jon Krohn sits down with Professor Kyunghyun Cho, whose 200,000 citations and co-authorship of the first paper on attention place him among the most influential AI researchers in the world. In this episode, Kyunghyun explains why today’s models have already captured most correlations in passive data, making the real challenge about actively choosing which data to collect. He also weighs in on the open debate around world models, whether AI needs high-fidelity, step-by-step imagination or whether a high-level latent representation that lets it skip ahead is sufficient and shares the surprising discovery that 80% of his 200 computer science students had never installed a coding agent. Additional materials: www.superdatascience.com/977 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (06:43) The story behind the attention mechanism (28:43) Sample efficiency and active data collection (39:04) World models and latent planning (49:52) Teaching undergrads with coding agents (58:21) Reranking, multi-stage ranking, and the foundations of RAG
NVIDIA just dropped Nemotron 3 Super, a 120-billion-parameter open-weight model that only activates 12 billion parameters at a time and it’s built for the agentic AI era. In this Five-Minute Friday, Jon Krohn breaks down the model’s hybrid Mamba-Transformer architecture, its million-token context window, and why its combination of frontier-class reasoning with blazing-fast throughput matters for anyone building multi-agent systems. Find out how Nemotron 3 Super claimed the #1 spot on the DeepResearch Bench leaderboards, which companies are already adopting it, and where you can start using it today. Additional materials: www.superdatascience.com/976 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Zack Kass speaks to Jon Krohn about his bestselling, tech-positive book, The Next Renaissance, that charts the rapid progress of humanity and the benefits that artificial intelligence will bring to us, as well as why a future where intelligence is a cheap and abundant resource will give humanity an edge. Elsewhere in the show, Zack discusses why it’s important to hold parents, teachers and students accountable for their education, why it is incumbent on us to build a healthier relationship with technology, and his 4 principles for thriving in the age of AI. This episode is brought to you by the Cisco, by Acceldata and by ODSC, the Open Data Science Conference. Additional materials: www.superdatascience.com/975 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:14) About Zack Kass’ book, The Next Renaissance (20:18) The importance of literacy skills in the age of AI (28:01) AI in education (41:01) Principles for living in the era of AI
In this week’s Five-Minute Friday, Jon Krohn holds the AI bubble up to the light. He points to the deep greyzone found in AI startups like Cluely that are established on dubious ideas (Cluely’s tagline was “cheat on everything”) and funding bluster, as well as the staggering spending by companies on infrastructure and researcher salaries. Listen to the episode to hear about the historical precedents to the AI bubble that go all the way back to the invention of the railway, what to make of current investments in AI, and what you can do about these changes as an AI practitioner. Additional materials: www.superdatascience.com/974 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
No one should be manually writing code in 2026, thinks Chris Fregly, Jon Krohn’s guest on this week’s episode. In this interview about Chris’ latest book, AI Systems Performance Engineering, he explains why it’s so important to consider memory bandwidth when evaluating GPU performance, that understanding the full hardware software stack is the most valuable skill for anyone working in AI development, and which shortcuts we still shouldn’t ever take when writing code, even though we might be outsourcing a great deal to generative AI. This episode is brought to you by the Cisco, by Acceldata and by ODSC, the Open Data Science Conference. Additional materials: www.superdatascience.com/973 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:39) Why Chris wrote AI Systems Performance Engineering (21:39) Essential coding metrics (37:24) The importance of inference when coding (42:11) How to manage workflows while using AI agents (51:37) Where and how to invest in the AI market
Jon Krohn recaps the month of February in this episode of In Case You Missed It. Across four interviews with Will Falcon (Episode 965), Tom Griffiths (Episode 969), Antje Barth (Episode 963), and Praveen Murugesan (Episode 967), Jon questions the brains behind some of the AI industry’s most innovative companies about launching a startup, developing a popular product, what artificial intelligence can still learn from human intelligence, and how AI might finally start to think on its own. Additional materials: www.superdatascience.com/972 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Lin Qiao, CEO of Fireworks AI, talks to Jon Krohn about how she builds effective models quickly, why coding agents can perform at the level of a junior engineer, and what she attributes to the success of Fireworks AI: True to its name, the company exploded into the AI industry with over $300 million secured in venture capital, as well as netting a further $250 million Series C funding. For Lin, many enterprises miss out by not being familiar with open models. Open models give a lot of control to the user, offering customizability and at a much lower price point. Listen to hear how Fireworks AI helps companies continue to save money through AI. This episode is brought to you by the Dell, by Intel, by Cisco and by Acceldata. Additional materials: www.superdatascience.com/971 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:19) All about Fireworks AI (24:16) Why companies need to take notice of open models (33:05) The commercial viability of slow-reasoning models (38:51) Fireworks AI’s approach to model performance evaluations
Working with code-gen models and Claude Code: In this Five-Minute Friday, Jon Krohn addresses how AI superstars like Andrej Karpathy are using AI agents in their coding work, the outlook for code-gen in 2026, and how you can get started. Hear about Karpathy’s work as well as the soaring success of Peter Steinberger and how he managed to surpass the GitHub commit rate of teams as an individual working with AI agents. Additional materials: www.superdatascience.com/970 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Princeton Professor Tom Griffiths talks to Jon Krohn about his new book, The Laws of Thought, which grapples with the mathematical models behind biological and artificial intelligence, and what makes the human brain so fascinating for psychologists and computer scientists to study. In this episode, he details how the mathematical principles governing the external world can also be used to explore cognitive science, or “the internal world.” This episode is brought to you by the Dell, by Intel, by Cisco and by Acceldata. Additional materials: www.superdatascience.com/969 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:18) Tom Griffiths’ current research (21:23) On mathematical inference in LLMs (35:19) How to engineer inductive bias (52:00) How to model curiosity into AI systems
Now that AI agents can develop new apps from product development to delivery, do AI developers have reason to worry about their careers? Podcast host Jon Krohn addresses the stark predictions that AI could “eliminate half of all entry-level white-collar jobs” by going back to the data. Find out why the numbers show a very different picture, which in-demand occupations have increased by 40% since late 2022, and Jon’s advice on why technical professionals shouldn’t panic in this latest Five-Minute Friday. Additional materials: www.superdatascience.com/968 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
VP of Engineering at Samsara Praveen Murugesan talks to Jon Krohn about processing 20 trillion data points covering 90 billion miles across private and public sectors, how the company helps truckers who operate long hours and travel for long stretches without cellphone signal, and who they’re looking to hire to help this physical AI pioneer keep on developing high-impact solutions for real-world problems. And, if you’re looking to work for the company, there’s no better time to apply, and you’ll want to listen to the end of the show to hear exactly what Praveen looks for in new hires. This episode is brought to you by the Dell, by Intel, by Acceldata and by the ODSC, the Open Data Science Conference. Additional materials: www.superdatascience.com/967 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:01) The challenges of working with logistics data (16:43) Operating Edge AI in logistics and construction sectors (28:43) How quantum computing might redefine logistics (40:09) The real cost of swapping human heuristics for algorithmic planning (44:45) How to get a job at Samsara
Jon Krohn gives Five-Minute Friday listeners all the details about the new social network causing a stir, Moltbook. What makes Moltbook so unique is that this is the first network designed just for AI agents. It’s an exclusive club, only its alleged 1.5 million registered agents can post, comment, and upvote, but we can watch this real-world experiment in agent ecology from the sidelines. Listen to the episode to hear the fascinating, if disturbing, story of Moltbook’s swift turn into facilitating a digital theocracy and forms of government, and whether this development is a sign of an approaching singularity or rather AI continuing to ape human thought and turn it into slop. Additional materials: www.superdatascience.com/966 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
CEO of Lightning AI Will Falcon speaks to podcast host and Lightning AI fellow Jon Krohn about the company’s merger with Voltage Park, and why Will has named it the “full-stack AI neo-cloud for enterprises and frontier labs”. Lightning AI’s offer is a secure, flexible, and collaborative environment that can run on the cloud, all essentials for early-stage startups. Listen to the episode to hear Will Falcon discuss Lightning AI Studio, founding PyTorch Lightning, and how he came to found his AI company. This episode is brought to you by the Dell, by Intel, by Fabi and by Cisco. Additional materials: www.superdatascience.com/965 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:20) Lightning AI’s merger with Voltage Park (20:54) About neo-clouds (43:51) How Will founded Lightning AI (54:48) Current gaps in the AI in workplace
In this first of the year ICYMI episode, Jon Krohn selects his favorite moments from January’s SuperDataScience interviews. Listen to why incentivizing workers is the best way to get them to disclose their use of AI tools and pave the way for an AI-forward future, how AI continues to mimic human development in its own evolution, the importance of evaluation in building AI systems, and how to keep your best employees (and also: how to know your value) with guests Sadie St. Lawrence, Ashwin Rajeeva, Sinan Ozdemir, Vijoy Pandey, and Ethan Mollick. Additional materials: www.superdatascience.com/964 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Bestselling author and Gen AI instructor Antje Barth talks to Jon Krohn about her work at Amazon’s AGI Labs and their newest product Nova Act, as well as where we will see the most success with AI agents and how AI developers can reap those rewards. This episode is brought to you by the Dell, by Intel, by Fabi and by Cisco. Additional materials: www.superdatascience.com/963 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:23) Amazon’s latest product, Nova Act (11:05) How Nova Act tests reliability (24:01) Where Amazon’s 1000s of gen AI deployments succeed (31:32) How Nova Act maintains its security (36:32) The increasing value of agentic AI developers
Bestselling author of Co-Intelligence: Living and Working with AI Ethan Mollick speaks to Jon Krohn about just how much US firms have to gain from a willingness to adopt and experiment with AI, as well as the reality behind AI use among employees and the frontier models set to support them even further. Additional materials: www.superdatascience.com/962 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Dr. Vijoy Pandey returns to the show to talk to Jon Krohn about Cisco’s work to advance medicine and mitigate the impact of climate change with distributed artificial super-intelligence. Dr. Vijoy Pandey believes in a future where humans and AI agents work together to tackle our biggest challenges. For this to happen, we will need to have multi-agent systems and open-source platforms that let agents work together, avoiding the phenomenon of AI agents being “isolated geniuses” unable to collaborate. He elaborates on what Cisco is doing to close this gap. This episode is brought to you by the Dell, by Intel, by Fabi and by Scaylor. Additional materials: www.superdatascience.com/961 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:55) A definition of artificial super-intelligence (14:03) Distributed learning through Cisco’s Outshift (21:29) The semantic protocols for sharing intent in a distributed artificial super-intelligence framework (37:44) The cognitive memory fabric of the distributed artificial super-intelligence framework (46:24) Using cognitive engines as part of the distributed artificial super-intelligence framework
For 2026’s first episode of In Case You Missed It (ICYMI), Jon Krohn selects 6 clips from December for a wide-ranging look at the current state of AI in business and beyond. Hear from Joel Beasley (Episode 945), Jeff Li (Episode 947), Sandy Pentland (Episode 949), Josh Clemm (Episode 951), Penelope LaFeuille (Episode 952), and John Roese (Episode 953) on ensuring your AI systems get adopted and succeed in their goals, interesting ways to use AI for standup comedy routines, and more. Additional materials: www.superdatascience.com/960 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
AI entrepreneur and bestselling author Sinan Ozdemir speaks to Jon Krohn about the practical differences between agentic AI and AI workflows, why evaluating accuracy on its own won’t tell you enough about AI models, and more about his latest book Building Agentic AI. This episode is brought to you by the Dell, by Intel, by Fabi and by Cisco. Additional materials: www.superdatascience.com/959 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:57) Exploring the differences between workflows and agents (17:03) How to work out parameter count for a given task (25:26) The best way to evaluate LLMs (33:12) How to run hybrid workflow + agentic projects effectively
In this #sponsored Feature Friday episode, Salesforce’s Rahul Auradkar speaks to Jon Krohn about the company’s unified data engine and how its acquisition of Informatica provides the missing context layer for AI models and agents. Hear how Salesforce’s Data 360 helps customers to get accurate and insightful information about their business, and what AI models need to benefit a company’s bottom line (hint: it’s not only large amounts of data!) Additional materials: www.superdatascience.com/958 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
AI agents, data lakes, and managing data sprawl: Ashwin Rajeeva, cofounder and CTO of Acceldata, speaks to Jon Krohn about how the agentic data management startup raised over $100 million in venture capital to expand its business in automating data quality assurance as well as cataloguing and pipeline maintenance across enterprise environments. Acceldata utilizes multiple agents to solve enterprise-grade questions with company data. It also uses autonomous data pipelines that can detect and fix issues without human intervention, and the platform’s agentic data management system ADM also lets humans stay in the loop wherever needed. This episode is brought to you by the Dell, by Intel, by Fabi and by Cisco. Additional materials: www.superdatascience.com/957 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:25) About Acceldata and xLake (15:03) Autonomous data pipelines (21:02) How and when to keep humans in the AI loop (27:43) How Acceldata solves ‘data sprawl’ (31:53) Habits of successful tech leaders
#Sponsored SVP, Head of Product for Salesforce’s AppExchange & Ecosystem, Tyler Carlson, talks to Jon Krohn about taking AI agents from prototype to enterprise-grade production with the Agentforce 360 Platform. Though we may now have plenty of tools to build demos for AI agents, most teams still struggle to turn early prototypes into secure and scalable products. With Salesforce’s Agentforce 360 Platform, users can build customer-focused agentic applications with a multi-LLM planner service for reasoning and logic, as well as a new scripting language for deterministic control over how agents interact with the contextual layer of the Salesforce data model. Learn how Salesforce’s WYSIWYG schema builder helps customers build fully functional applications with low- and pro-code capabilities, the common mistakes that innovators make when moving from prototype to production, and whether AI agents might replace non-agentic AI applications in this Feature Friday. Additional materials: www.superdatascience.com/956 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Sadie St Lawrence joins Jon Krohn to discuss what to expect from the AI industry in 2026. Sadie and Jon talk through what they think will be the five biggest trends in AI, hand out awards for the best moments, comebacks, and disappointments in AI in 2025, and review how their predictions for 2025 played out. Hear Sadie’s five exciting predictions for 2026, from emerging jobs in AI to an important return to the drawing board! This episode is brought to you by the Dell, by Intel, by Fabi and by MongoDB. Additional materials: www.superdatascience.com/955 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (11:36) Recapping Sadie and Jon’s predictions for 2025 (26:54) The SuperDataScience Awards in AI (49:05) Prediction #1 for AI in 2026 (52:13) Prediction #2 for AI in 2026 (53:33) Prediction #3 for AI in 2026 (57:54) Prediction #4 for AI in 2026 (1:01:01) Prediction #5 for AI in 2026
Jon Krohn wraps up 2025 with his thoughts on how agentic AI has become as much a resounding success as an annoying buzzword for many in the tech industry, why such promising developments in generative AI mean that well-prepared, secured data will be ever more crucial, and Jon’s hopes for a better year for everyone across the world in 2026. Additional materials: www.superdatascience.com/954 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Dell Technologies’ John Roese talks to Jon Krohn about the phenomenon of “agent-washing”, his contribution to Dell’s incredible revenue boost in 2025, and why “knowledge layers” will be crucial to future tech. Hear also John’s predictions for where AI is going to lead us in 2026, from better, clearer governance, data management methods and definitions for agentic AI, to systems that keep AI tools and our data running and secure with the help of “AI factories” and “sovereign AI”. This episode is brought to you by MongoDB and by Y Carrot. Additional materials: www.superdatascience.com/953 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:31) John Roese’s predictions for AI in 2026 (11:28) How John increased ROI at Dell Technologies (18:59) John’s predictions for AI in 2026 (40:13) How Dell’s clients are using AI factories
“The Fit Data Scientist” newsletter author Pénélope Lafeuille talks to Jon Krohn about how to give your all at work, offering her top tips for a healthy body and a healthy mind. Learn why “The SuperDataScience Podcast” made it onto her top 3 data science podcasts, and why following your passion can pay off in dividends for your career. Additional materials: www.superdatascience.com/952 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
VP of Engineering at Dropbox Josh Clemm speaks to Jon Krohn about consolidating search tools across apps with the AI-powered workspace, Dropbox Dash, the new collaborative AI systems that enhance interoperability between team members and their projects, and how to avoid “context rot”. Dropbox Dash gives users the best of Dropbox’s cloud storage and search functions, plus a “universal search” ability to locate information across multimedia and apps. “AI really needs to understand you and your team, first and foremost, and all that connected data,” says Josh. This episode is brought to you by the Dell, by Intel, by Airia and by MongoDB. Additional materials: www.superdatascience.com/951 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:07) All about Dropbox Dash (10:00) The benefits of browser-embedded AI (22:17) Why context engineering is so critical to agentic systems (37:51) How creating apps helps tech leadership (48:39) When to decide to use data versus intuition
In this special holiday episode, the SuperDataScience Podcast team comes together to wish you happy holidays and thank you for listening throughout the year. Team members from around the world share warm greetings in their own voices and languages as we reflect on another year of learning, curiosity, and community. From all of us at SDS, we wish you a joyful holiday season and look forward to bringing you more data science, machine learning, and AI content in the year ahead. Additional materials: www.superdatascience.com/950 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Alex “Sandy” Pentland, Toshiba Professor of Media Arts & Science at MIT and Fellow at Stanford, speaks to Jon Krohn about his new book, Shared Wisdom, why he attributes AI to the collapse of the Soviet Union, and why those risks to society could still be relevant today. We can only achieve better system performance, Alex says, when we build tools that keep step with the way that people make decisions. Listen to the episode to hear Alex talk about how he is helping make AI agents work for individuals rather than the companies that develop them, and his work in making sure that systems operate consistently and fairly across the world. This episode is brought to you by the Dell, by Intel, by Fabi, and by Airia. Additional materials: www.superdatascience.com/949 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:19) About Alex Pentland’s new book, Shared Wisdom (16:00) About loyalagents.org (28:36) Why we need data unions (34:02) The governance of AI (41:24) How to measure the social impact of AI projects
In this November episode of “In Case You Missed It” series, Jon Krohn selects his favorite clips from the month. Hear from Shirish Gupta and Tyler Cox (Episode 939), Vikoy Pandey (Episode 941), Marc Dupuis (Episode 937), and Maya Ackerman (Episode 943) on getting back to human motivation and the importance of evaluating the tools and data we use. Additional materials: www.superdatascience.com/948 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jeff Li tells Jon Krohn what it's like to work at scale as a data scientist and a machine learning engineer at Netflix, Spotify and DoorDash, as well as how to get a foot in the door at these companies. Jeff also discusses how to run forecasts and trends, and how to read their results. Listen to hear Jeff Li discuss how Spotify became a podcast powerhouse, his startup move.ai, and the tools he uses every day. This episode is brought to you by the Dell, by Intel, by Fabi, and by Airia. Additional materials: www.superdatascience.com/947 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (09:05) Forecasting in data science (23:33) How to get a data science job at Netflix (30:06) Jeff’s experience on launching an AI startup (51:57) Jeff’s AI toolkit
Jon Krohn looks into the benefits of robotaxis, from safety to affordability, in this Five-Minute Friday. Hear about Waymo’s partnership with Jaguar Land Rover, the latest safety studies concerning driverless vehicles, and a case for robotaxis becoming the preferred method of transport in the US, where households spend roughly 15% of their budget on vehicle ownership. Additional materials: www.superdatascience.com/946 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Is there humor in data? Joel Beasley, host of Modern CTO, tells Jon Krohn how he used AI to turn his sights to stand-up comedy. He also shares his tips on tech leadership that he learned from his popular podcast, Modern CTO, and how he is using generative AI as a collaborative partner in his creative work. This episode is brought to you by the Dell, by Intel, by Fabi, and by Gurobi. Additional materials: www.superdatascience.com/945 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:14) Joel Beasley on his comedy career (19:04) Applying the ‘memory palace’ technique (22:28) About The Modern CTO Podcast (36:24) Leadership advice from The Modern CTO
Google is steaming ahead with launching its top-league new Gemini 3 Pro model across their product suite, from Google Search to Vertex AI cloud services. The multinational tech company is also letting eager early adopters like Wayfair and GitHub. Get all the detailed data, its performance across hard-to-game industry benchmarks, and what this all means for the way you use generative AI, in this week’s episode. Additional materials: www.superdatascience.com/944 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Creative human-AI partnerships and AI-generated music: WaveAI CEO and co-founder Maya Ackerman speaks with Jon Krohn about learning to see – and accept – AI’s potential as a creative partner in a human-centric, AI-forward future. Listen to the episode to hear Maya Ackerman discuss reframing hallucination as a creative force, her work at WaveAI, and how to push the boundaries of creativity using generative AI. This episode is brought to you by the Dell, by Intel, by Gurobi and by Airia. Additional materials: www.superdatascience.com/943 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (05:20) Maya’s challenge to anthropocentrism (19:26) How to compose music with AI (28:13) How to invest in creative empowerment (32:18) How to produce genuinely creative artworks through AI (44:58) The future of GenAI
What’s on the horizon for AI? Jon Krohn wades through opinions from more than experts, curated by the Longitudinal Expert AI Panel (LEAP), about what we can expect from the industry. From estimates on AI-assisted workers through energy consumption to AI performance in highly skilled domains, find out just how much LEAP thinkers believe AI is permeating our daily work and life in this Five-Minute Friday. Additional materials: www.superdatascience.com/942 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Vijoy Pandey imagines a bold new society in which agents and humans make scientific discoveries and complete physical tasks together, and he tells Jon Krohn about his work at AGNTCY, Cisco’s open-source platform for the Internet of Agents. Listen to the episode to hear Vijoy Pandey talk about how a future society in which multi-agents and humans interact may be a real possibility, what TCP/IP is, how to find trustworthy AI agents, and how to get your hands on AGNTCY today! This episode is brought to you by the Dell, by Intel, by Fabi and by Gurobi. Additional materials: www.superdatascience.com/941 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:37) All about AGNTCY (12:04) How an agent-human society might function (15:19) What an “Internet of Agents” means (27:17) The future of access management (41:39) How to trust AI agents (48:49) How to get started with AGNTCY
Jon Krohn curates a selection of clips from the month that was. Hear from the orchestrators of an expanding AI universe in this episode of In Case You Missed It, with news, views and groundbreaking ideas from Sheamus McGovern, Jerry Yurchisin, Stephanie Hare, Larissa Schneider, and Adrian Kosowsky. We cover baby dragons, the Hippocratic Oath, and, of course, all the latest in artificial intelligence! Additional materials: www.superdatascience.com/940 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
State space models (SSMs), granite models, and Mamba: Dell’s Tyler Cox and Shirish Gupta discuss with Jon Krohn why state space models can process information so efficiently, and how Dell’s AI factory helps enterprises manage custom AI workloads. Hear the latest on the Dell Pro AI Studio and Dell’s partnerships with IBM and Hugging Face in this episode. This episode is brought to you by the Trainium2, the latest AI chip from AWS and by Gurobi. Additional materials: www.superdatascience.com/939 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:58) Dell Pro AI Studio news (23:17) How Dell manages interoperability (28:08) About the Dell/IBM granite models (47:38) How to troubleshoot AI tools (52:36) How Dell performs against benchmarks
Jon Krohn speaks to Rohan Kodialam, Cofounder and CEO of Sphinx, the company that redefines how machine intelligence reasons data with frontier AI. In this Feature Friday, Jon and Rohan discuss the benefits of using Sphinx to assist with data analysis. Get under the hood to learn how Sphinx operates, from running commands to ensuring your data stays secure, and find out how you can get your hands on this great tool for free. Additional materials: www.superdatascience.com/938 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
AI tools won’t eliminate but elevate data scientists, says Marc Dupuis. The CEO of fabi.ai talks to Jon Krohn about the new wave of AI-driven platforms that integrate workflows within popular work tools like Slack and email, and how building AI-first products means widening access to all ability levels. This episode is brought to you by the Gurobi, by Dell and by Intel. Additional materials: www.superdatascience.com/937 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (09:31) Will fabi.ai outshine data science practitioners (20:40) Resolving workflows with fabi.ai (24:08) Creating AI agents with fabi.ai (45:23) How to avoid ‘gaming’ targets
How much power – and risk – do we carry around with us in our pockets? A Reuters investigation about how easily LLMs can be utilized for online phishing scams is the subject of this week’s Five-Minute Friday with Jon Krohn. By asking six of the most popular LLMs (Grok, ChatGPT, Meta AI, Claude, DeepSeek and Gemini) to generate phishing emails specifically targeting elderly people, Reuters found the safety sometimes severely lacking in the models. Listen to the episode to hear Jon quantify this problem with real-world examples, why mere content warnings in LLM models don’t work, and the troubling results of the phishing requests. Additional materials: www.superdatascience.com/936 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jon Krohn speaks to researcher, broadcaster and author Stephanie Hare about how the Hippocratic Oath might apply to artificial intelligence, and a guiding ethos for pushing innovation while protecting users from harm. A code of conduct, she says, could be one approach to ensuring that people are using technology more mindfully and ethically, as well as an opportunity for users to feel that they belong to a wider, global community. Although she sympathizes with people concerned by overregulation undermining innovation, Stephanie also notes that we expect certain standards to be met elsewhere, such as vehicle and drug safety, as well as fair journalistic practices. As Stephanie explains, we need to find a realistic middle ground between innovation and regulation. This episode is brought to you by the Dell, by Intel, by Fabi and by Gurobi. Additional materials: www.superdatascience.com/935 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:23) What ‘technology ethics’ is (14:46) Developing a Hippocratic Oath for tech (42:32) How to protect against sensationalism (53:38) How to maintain a balance of growth and infrastructure
With the number of jobs dramatically slowing in the last year, many question if this decline is down to companies turning to AI for completing entry-level tasks in particular. Research published earlier this month by Yale University shows no major difference in the types of roles and tasks in so-called `white-collar jobs` since late 2022, an auspicious date that coincides with the launch of ChatGPT. In this week‘s Five-Minute Friday, host Jon Krohn discusses if and when AI will undercut junior-level jobs, particularly in the US. Additional materials: www.superdatascience.com/934 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Sheamus McGovern, CEO of Open Data Science, takes Jon Krohn and his listeners on a journey to launching his popular data science and AI conference, now in its tenth year, as well as the great shifts to the fields that he has seen on the way. For Seamus, the growth of his Open Data Science Conference has shown him that an AI engineer is just the beginning of several roles that will emerge from the industry. He asks Jon to consider the breadth of tasks demanded of today’s engineers, from data profiling and transformation to feature engineering, hyper-parameter tuning, and model deployments. Just as the AI engineer emerged from the data scientist role, Seamus expects the industry to respond to the broadening range of projects and tools with new, niche, and dynamic job roles. This episode is brought to you by the Trainium2, the latest AI chip from AWS, by Gurobi, by Dell and by Intel. Additional materials: www.superdatascience.com/933 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:50) Why Seamus started ODSC (18:27) The differences in AI engineers and data scientists (24:20) How to keep up with AI’s rapid pace (33:51) How people hire for AI orchestration (46:26) How companies can get team skillsets right
Larissa Schneider speaks to Jon Krohn in this Feature Friday about finding the right time to invest in AI solutions, and when it’s better to build them yourself. She discusses her work leading global strategy and operations at Unframe, and how they raised $50 million in venture capital since the company’s launch in March 2025. Additional materials: www.superdatascience.com/932 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
AI predictions, and how to act on them: Data Science Strategist at Gurobi, Jerry Yurchisin, speaks to Jon Krohn about how mathematical optimization helps enterprises automate decisions for business success and where to find the resources to make it happen. This episode is brought to you by the ODSC, the Open Data Science Conference, by Fabi, by Dell, and by Intel. Additional materials: www.superdatascience.com/931 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:34) What mathematical optimization is (13:58) How to get started with mathematical optimization (45:56) Gurobi’s use cases (56:29) Quantum computing and mathematical optimization
Jon Krohn’s highlights from this month of interviews focus on ways to future-proof your career, looking at the hardware that will get you the most mileage, the emerging roles that are well worth a look, and the developments in AI that will endure in a field constantly testing the durability of its own breakthroughs. Additional materials: www.superdatascience.com/930 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Breaking news: Jon Krohn welcomes Adrian Kosowski to the show to talk about the groundbreaking research happening at Pathway. Adrian and his team demonstrate how they have brought attention in AI closer to the way the brain functions, creating, in essence, a “massively parallel system of [artificial] neurons” that communicate with one another and exhibit properties similar to natural neurons. The goal is to move beyond the current limitations of transformers, where reasoning can be generalized across more complex and extended reasoning patterns, approximating a more human-like approach to problem-solving. This episode is brought to you by the Trainium2, the latest AI chip from AWS, by Dell, by Intel, by and Gurobi. Additional materials: www.superdatascience.com/929 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:27) Pathway’s ground-breaking new biologically inspired architecture (20:40) Limitless context windows (34:39) BDH architecture as positive space (53:11) Building multilingual models (1:01:07) How to access the BDH architecture
Prompt injections, malicious code, and AI agents: In this week’s Five-Minute Friday, Jon Krohn looks into the current security weaknesses found in AI systems. A structural vulnerability that The Economist dubs a “lethal trifecta” could cause havoc for AI users, unless we take the necessary steps to contain our systems. Additional materials: www.superdatascience.com/928 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Earlier this year, David Loker joined CodeRabbit as their Director of AI. As more people come to write code with the help of large language models, David believes CodeRabbit will become a helpful assistant for code reviewing and pull requests. He tells Jon Krohn how CodeRabbit assists developers with real-time feedback, as well as the reality of vibe coding, the optimization challenges of agentic AI, and other pressing questions in AI and tech. This episode is brought to you by the Dell, by Intel, by Gurobi and by ODSC, the Open Data Science Conference. Additional materials: www.superdatascience.com/927 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (01:26) How CodeRabbit helps with coding (17:30) Context engineering in context (40:40) How CodeRabbit keeps data secure (46:10) David’s thoughts on “vibe coding” (1:03:04) If machines will ever be truly creative
In this Five-Minute Friday, Jon Krohn explores how AI is reshaping the legal industry. He investigates how AI tools are helping lawyers make conclusions faster, how paralegals are being retrained, and the latest in-demand role in law (hint: It concerns AI). Listen to hear how Harvey AI and Thomson Reuters’ CoCounsel are using AI to help lawyers get ahead. Additional materials: www.superdatascience.com/926 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Tech innovation’s dependence on economic systems, trust in technology throughout history, and job displacement through AI: The Dieter Schwartz Associate Professor of AI and work at the University of Oxford, Carl Benedikt Frey, talks to Jon Krohn about his latest book, How Progress Ends, as well as how different economic systems deal with innovation and scaling, dealing with the homogeneity of generative AI output, and how to stay afloat in the new wave of job automation. This episode is brought to you by the Dell, by Intel, by ODSC, the Open Data Science Conference and by Gurobi. Additional materials: www.superdatascience.com/925 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:00) All about How Progress Ends: Technology, Innovation, and the Fate of Nations (14:26) The role of weak ties in driving technological innovation (18:22) How to keep innovating as a big business (48:05) What we can learn and apply from previous industrial revolutions (54:33) How workers can try to ‘future-proof’ themselves
MIT lab NANDA (“Networked AI Agents in Decentralized Architecture”) reveals less than promising results for the future of AI adoption in businesses. According to “The GenAI Divide: State of AI in Business 2025”, a whopping 95% of enterprise AI projects “are getting zero return” on their $30-40 billion investment. Jon Krohn takes this Five-Minute Friday to look into why this has happened, with help from a critical response to the report written by Futuriom’s R. Scott Raynovich. Additional materials: www.superdatascience.com/924 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Graphs, but not as you would expect them: Graph analytics guru Amy Hodler speaks to Jon Krohn about the graph data structure and graph applications, graph algorithms, graph RAG, and graphs as memory systems for AI agents. We can use graphs in a surprising number of ways. Money laundering and fraud, as well as supply-chain crime, leave breadcrumbs at multiple “touch-points” over time, behaviors that graphs are better suited to reveal than rows and tables. Amy sees that most interest in graphs has been in the cybersecurity space. But this work isn’t only restricted to fighting crime! Listen to the episode to hear more case examples and how to get into graph work. This episode is brought to you by the Dell, by the Intel, by ODSC, the Open Data Science Conference and by Gurobi. Additional materials: www.superdatascience.com/923 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: 01:49) A brief history of graphs (10:08) Uncovering fraud with graphs (28:31) Where graphs are most commonly applied, to date (34:49) Retrieval augmented generation graphs (48:04) The future of graphs
Hugo Dozois-Caouette speaks to Jon Krohn about his startup MaintainX and how he secured $254 million in venture capital, reaching a $2.5 billion valuation. MaintainX builds computerized maintenance management systems (CMMS) and enterprise asset management (EAM) software for industrial and manufacturing companies. This "digital clipboard" delivered through web and mobile apps connects machines, work orders, and frontline teams to boost productivity, reduce downtime, and prevent costly breakdowns. The platform captures knowledge from experienced workers and delivers AI-powered insights, with features like MaintainX CoPilot helping teams troubleshoot issues and make faster decisions. Listen to the episode to hear Hugo's perspective on manufacturing gaps that technology can fill, MaintainX's tech stack, and how CMMS platforms address information disconnects that slow down frontline teams. Additional materials: www.superdatascience.com/922 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Using Windows for AI development and the bleeding edge of NPUs: Shirish Gupta and Ish Shah from Dell Technologies speak to Jon Krohn about the latest products from Dell, the future of neural-processing units (NPUs), and how AI developers can make sound hardware investments. This episode is brought to you by the Trainium2, the latest AI chip from AWS, by ODSC, the Open Data Science Conference and by Gurobi. Additional materials: www.superdatascience.com/921 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:18) Why Windows still outranks other operating systems (20:58) The difference between GPUs and NPUs (32:44) How to access and use Dell’s NPUs and GPUs (49:08) Using processing units on the cloud versus locally (57:43) About the Dell Pro Max
This month’s episode of In Case You Missed It gives us reasons to be cautiously optimistic about the future of large language models (LLMs), with guests discussing what to do about recent reports that found AI agents blackmailed human users when threatened, the importance of post-training LLMs, and the training we have available for data and AI engineers to create robust, secure, and useful AI. Jon Krohn includes clips from his interviews with Akshay Agrawal (Episode 911), Julien Launay (Episode 913), Michelle Yi (Episode 915), and Kirill Eremenko (Episode 917). Additional materials: www.superdatascience.com/920 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
PyTorch, AGI, and the future of alignment research: Aurélien Géron joins Jon Krohn in this live interview to talk about the fourth edition of his bestselling Hands-On Machine Learning as well as what superintelligence makes him hopeful for, as well as what concerns him about machines surpassing human intelligence. This episode is brought to you by Gurobi and by the Dell AI Factory with NVIDIA Additional materials: www.superdatascience.com/919 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:04) Why Aurélien wrote Hands-On Machine Learning (20:54) How Aurélien came to decide on material for the new edition (28:53) Aurélien’s predictions for AGI (51:21) How to support alignment research (1:13:42) Does superintelligence mean super-capability
In this Five-Minute Friday, Jon Krohn introduces listeners to CrewAI, an open-source Python framework that can create and manage multi-agent teams. The clue is in the title: CrewAI assembles specialized agents into single “crews” that achieve complex goals between them. CrewAI’s agent teams can also learn and iterate, meaning that after the crew has achieved its goals for the first time, they can refine and tailor their approach to future goals. Additional materials: www.superdatascience.com/918 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Founder of SuperDataScience, Kirill Eremenko, talks to Jon Krohn about how he found the best tools and approaches to help launch his 8-week AI engineering bootcamp. He breaks down the topics participants cover each week, and he also shares his tips with listeners who might want to start their own tech bootcamp or sign up for SuperDataScience’s September 2025 cohort. This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference Additional materials: www.superdatascience.com/917 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (10:58) Weeks 1-4 of the SuperDataScience bootcamp (37:52) How to use AI to drive the bottom line in business (47:50) Weeks 5-8 of the SuperDataScience bootcamp (54:50) How to convert LLMs to agents (1:09:33) Jon’s feedback on the SuperDataSciencebootcamp
GPT-5 has just been released, but with not very much fanfare. In this Five-Minute Friday, Jon Krohn asks if GPT-5 deserves the community’s underwhelmed response to its release. He outlines five features of the model and explains why people might be feeling less than enthusiastic in the broader context of LLM development. Which LLMs are leading the way, and which are still playing the game of catch-up? Additional materials: www.superdatascience.com/916 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Tech leader, investor, and Generationship cofounder Michelle Yi talks to Jon Krohn about finding ways to trust and secure AI systems, the methods that hackers use to jailbreak code, and what users can do to build their own trustworthy AI systems. Learn all about “red teaming” and how tech teams can handle other key technical terms like data poisoning, prompt stealing, jailbreaking and slop squatting. This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA. Additional materials: www.superdatascience.com/915 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:31) What “trustworthy AI” means (31:15) How to build trustworthy AI systems (46:55) About Michelle’s “sorry bench” (48:13) How LLMs help construct causal graphs (51:45) About Generationship
In this Five-Minute Friday, Cofounder and CTO of lakeFS Oz Katz talks to Jon Krohn about data warehouses, data lakes, and how companies can handle increasingly complex data infrastructures and formats. Hear about lakeFS’s collaboration with Legofest, lakeFS’s approach to helping users collaborate on data lakes, and how to overcome the challenges of working with multimodal data. Additional materials: www.superdatascience.com/914 This episode is brought to you by the Dell AI Factory with NVIDIA.
Julien Launay launched Adaptive to give data science teams in business enterprises their “RLOps tooling” to make reinforcement learning easier. Talking to Jon Krohn, Julien says, “Most of our users are data scientists who write Python codes to interface with the system”. Adaptive is also able to work with companies without data science teams, collaborating with partners like Deloitte to add the necessary personnel. Julien is currently working on making his platform more widely available. Additional materials: www.superdatascience.com/913 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
In this episode of In Case You Missed It, we look back on five great interview episodes from July. Hear from Lilith Bat-Leah (Episode 901), Sinan Ozdemir (Episode 903), Sebastian Gehrmann (Episode 905), Zohar Bronfman (Episode 907) and Robert Ness (Episode 909). They’ll tell you why data-centric machine learning is so important across disciplines, starting with law, and how we can use AI benchmarks and “red teaming” to refine our search for the best AI models. Additional materials: www.superdatascience.com/912 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Reproducibility, Python notebooks, and data science communities: Software developer Akshay Agrawal speaks to Jon Krohn about Marimo, the next-generation computational notebook for Python, how he built and fostered a thriving community around the product, and what makes this notebook so versatile and accessible for users. Additional materials: www.superdatascience.com/911 This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
In this Five-Minute Friday, Jon Krohn looks into AI’s disruption of the journalism industry and how it has fundamentally reshaped news production. Multiple news outlets’ suing of ChatGPT over its use of copyrighted materials may have taken the most headlines to date, but this isn’t to say news media is rebuffing AI entirely. On the contrary, several outlets have launched summarization and analysis tools for both internal and external use, such as The New York Times’s Echo and The Washington Post’s Haystacker. This episode looks into the ways major news outlets are utilising AI, and what this means for journalists. Additional materials: www.superdatascience.com/910 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Researcher at Microsoft Robert Usazuwa Ness talks to Jon Krohn about how to achieve causality in AI with correlation-based learning, the right libraries, and handling statistical inference. When dealing with causal AI, Robert notes how important it is to keep aware of variables in the data that may mislead us and force inaccurate assumptions. Not all variables will be useful. It is essential, then, that any assumptions are grounded in a deeper understanding of how the data were gathered, and not what appears in the dataset. Listen to the episode to hear how you can apply causal AI to your projects. Additional materials: www.superdatascience.com/907 This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
The moral and ethical implications of letting AI take the wheel in business, as revealed by Anthropic: Jon Krohn looks into Anthropic’s latest research on how to use and deploy LLMs safely, specifically in business environments. The team designed scenarios to test the behavior of AI agents when given a goal and a set of obstacles to reach it. Those obstacles included 1) threats to the AI’s continued operation, and 2) conflict between the AI’s goals and the goals of the company. Hear Jon break down the results of this research in this Five-Minute Friday. Additional materials: www.superdatascience.com/908 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
“Intelligence has many forms,” says Zohar Bronfman, who speaks with Jon Krohn about the fascinating intersection between computational neuroscience and philosophy, and how it has brought him closer to understanding what is necessary to develop human-like intelligence in machines, as well as his motivations for launching Pecan AI and why predictive models outstrip generative models in business. Additional materials: www.superdatascience.com/907 This episode is brought to you by, Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:47) Why LLMs aren’t bringing us closer to AGI (33:44) About Pecan AI (51:03) Why data modeling is so challenging (1:01:25) How Pecan AI makes its tools widely accessible
Jason Corso speaks to Jon Krohn in this Five-Minute Friday all about Voxel51’s latest tool, Verified Auto-Labelling, and the company’s incredible success in developing popular tools for computer vision. Additional materials: www.superdatascience.com/906 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
RAG LLMs are not safer: Sebastian Gehrmann speaks to Jon Krohn about his latest research into how retrieval-augmented generation (RAG) actually makes LLMs less safe, the three ‘H’s for gauging the effectivity and value of a RAG, and the custom guardrails and procedures we need to use to ensure our RAG is fit-for-purpose and secure. This is a great episode for anyone who wants to know how to work with RAG in the context of LLMs, as you’ll hear how to select the best model for purpose, useful approaches and taxonomies to keep your projects secure, and which models he finds safest when RAG is applied. Additional materials: www.superdatascience.com/905 This episode is brought to you by, Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:28) Findings from the paper “RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models” (09:35) What attack surfaces are in the context of AI (38:51) Small versus large models with RAG (46:27) How to select an LLM with safety in mind
In this Five-Minute Friday, Jon Krohn reveals how AI is taking on the glitzy world of advertising. Bold claims from Meta and OpenAI contend that users will soon be able to plug in what they want and have AI churn out an ad campaign for little to no cost are shaking the advertising industry to its core. The fact that the four biggest sellers of ads (Google, Meta, Amazon, and ByteDance) are digital companies and accounted for over half of the global market in 2024 adds salt to the wound. Hear the three ways that AI is disrupting the industry, and who (or what) has the most influence on digital consumers to date. Additional materials: www.superdatascience.com/904 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Has AI benchmarking reached its limit, and what do we have to fill this gap? Sinan Ozdemir speaks to Jon Krohn about the lack of transparency in training data and the necessity of human-led quality assurance to detect AI hallucinations, when and why to be skeptical of AI benchmarks, and the future of benchmarking agentic and multimodal models. Additional materials: www.superdatascience.com/903 This episode is brought to you by Trainium2, the latest AI chip from AWS, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (16:48) Sinan’s new podcast, Practically Intelligent (21:54) What to know about the limits of AI benchmarking (53:22) Alternatives to AI benchmarks (1:01:23) The difficulties in getting a model to recognize its mistakes
In this episode of “In Case You Missed It”, Jon recaps his June interviews on The SuperDataScience Podcast. Hear from Diane Hare, Avery Smith, Kirill Eremenko, and Shaun Johnson as they talk about the best portfolios for AI practitioners, how to stand out in a saturated candidate market for AI roles, how to tell when an AI startup is going places, and ways to lead AI change in business. Additional materials: www.superdatascience.com/902 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Senior Director of AI Labs for Epiq Lilith Bat-Leah speaks to Jon Krohn about the ways AI have disrupted the legal industry using LLMs and retrieval-augmented generation (RAG), as well as how the data-centric machine learning research movement (DMLR) is systematically improving data quality, and why that is so important. Additional materials: www.superdatascience.com/901 This episode is brought to you by the Dell AI Factory with NVIDIA and Adverity, the conversational analytics platform. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (05:45) Deciphering legal tech terms (TAR, e-discovery) (13:47) How legal firms use data and AI (29:01) All about data-centric machine learning research (DMLR) (46:58) Lilith’s career journey in the AI industry
“Stay happy and healthy”: In this special Five-Minute Friday, Jon Krohn speaks with Annie, his grandmother, on her 95th birthday. Hear how she is physically and mentally coping with illnesses that limit her mobility and the joys of having a pet. Additional materials: www.superdatascience.com/900 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Data science skills, a data science bootcamp, and why Python and SQL still reign supreme: In this episode, Kirill Eremenko returns to the podcast to speak to Jon Krohn about SuperDataScience subscriber success stories, where to focus in a field that is evolving incredibly quickly, and why in-person working and networking might give you the edge over other candidates in landing a top AI role. Additional materials: www.superdatascience.com/899 This episode is brought to you by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:35) Stories from five SuperDataScience subscribers (27:32) How to secure a career in a fast-paced industry (44:19) How to stand out against huge competition in data science (1:01:40) The importance of communication in data science (1:16:41) Where to focus your skills in AI engineering
In this Five-Minute Friday, Jon Krohn announces his new, free workshop on Agentic AI. On this four-hour comprehensive course, you’ll learn the key terminology for working with these flexible, multi-agent systems and then get to grips with developing and deploying this artificial “team of experts” for all your AI-driven projects. Additional materials: www.superdatascience.com/898 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Diane Hare talks to Jon Krohn about the power of storytelling for corporate buy-in of AI initiatives, how to actively implement AI to transform organizations, and how emerging professionals can upskill themselves. Hear how she discovered her background in storytelling at Ernst & Young and her work with Simon Sinek, which she finds to be integral to her process. Inspired by Sinek’s aphorism “start with why”, Diane notes that many companies neglect this crucial part of their mission because they never take the time to work on it. Additional materials: www.superdatascience.com/897 This episode is brought to you by Trainium2, the latest AI chip from AWS, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:51) How Y Carrot works with BizLove (14:19) How BizLove prioritizes change management (29:18) How to upskill effectively (42:37) How BizLove integrated data from two enterprises (48:52) How to enable change in your business
The Economist reported that global Google searches for "AI unemployment" hit an all-time high earlier this year. But do we have to worry about AI taking our jobs? In this week’s Five-Minute Friday, Jon Krohn investigates whether the rise of AI has directly led to an increase in unemployment. Additional materials: www.superdatascience.com/896 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
How to get funded by a VC specializing in AI: Head of AIX Ventures Shaun Johnson talks to Jon Krohn about investment strategies, how to simplify AI adoption, why a little competition can be so beneficial to AI startups, and how Big Tech is circumventing anti-monopoly measures. Additional materials: www.superdatascience.com/895 This episode is brought to you by the Dell AI Factory with NVIDIA and by Adverity, the conversational analytics platform. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (10:36) What Shaun looks for when evaluating early-stage AI startups (19:11) Building out AI startups (41:44) How AI practitioners can future-proof their careers (45:27) How to measure AI impact (53:30) The key verticals ripe for AI disruption
In this episode of “In Case You Missed It”, Jon Krohn takes clips from interviews with guests in May 2025. From AI agent integration and RAG-based chatbots to education through virtual reality headsets and data harmonization, this episode explores how industry leaders are developing the tools and technologies that can improve operations, education, healthcare, and marketing. Highlight clips are with John Roese, Global Chief Technology Officer and Chief AI Officer at Dell Technologies (Episode 887), Senior Developer Relations Engineer at Posit, PBC Jeroen Janssens and Lead Data Scientist at Xomnia Thijs Nieuwdorp (Episode 885), Founder of CEEK Mary Spio (Episode 889), and Martin Brunthaler, Co-founder and Chief Technology Officer at Adverity (Episode 891). Additional materials: www.superdatascience.com/894 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Avery Smith is a passionate and motivational YouTuber and careers educator for data science. In this episode, Jon Krohn asks Avery about the tools and tricks he has learned from personal experience and from his students in how to get ahead in the tech industry. Avery shares the “learning ladder” he uses to help newcomers start on the right foot with great examples from former bootcamp students who have put his theories into practice. And, if you’re using LinkedIn to find jobs, Avery explains why this might be one of the reasons you’re not getting work. Additional materials: www.superdatascience.com/893 This episode is brought to you by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:02) How to take the jump into a data science career (15:19) Avery’s recommended strategy for starting a career in data science (18:10) Recommendations for people learning data science with LLMs (32:52) What should go into a data science portfolio (46:07) Why Avery prefers practice over theory in teaching data science (48:25) The bare minimum to get your first job in data science
Businesses have entered a “trough of disillusionment” for AI. In this Five-Minute Friday, Jon Krohn learns why Fortune 500 execs are so frustrated with the tools and how they can work their way up the “slope of enlightenment” towards effective AI. Hear why AI takeup hasn’t so far gone to plan in the corporate world and what that world needs from AI to encourage greater business engagement. Additional materials: www.superdatascience.com/892 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Martin Brunthaler talks to Jon Krohn about founding Adverity, a data analytics platform for marketing that simplifies integrating data from multiple sources and crunching them into actionable insights. Learn how Adverity became a data analytics powerhouse serving multiple industries, and why Martin thinks AI will strengthen rather than diminish the job market for data scientists, data analysts, and machine learning engineers. Additional materials: www.superdatascience.com/891 Today’s episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:52) How Martin co-founded Adverity (14:26) The features of Adverity (39:24) If data analysts, data scientists, and ML engineers should worry about Adverity making their job redundant (48:29) Martin’s predictions for the future for data analysts and data scientists (51:39) Martin’s tips for success as a CTO
In this week’s Five-Minute Friday, Jon Krohn reveals highlights from Stanford University’s AI Index Report. Released a few weeks ago by the Institute for Human-Centered AI, this annual report details the incredible technical advances, policies, and investments in artificial intelligence. Hear which models achieve the best performance relative to their size, in what scenarios top AI systems can outperform humans (and when humans still outperform AI), and more in Jon’s five key takeaways. Additional materials: www.superdatascience.com/890 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Founder of CEEK’s Mary Spio talks to Jon Krohn about how the platform contributes to the emerging community of digital creators with its blockchain-powered virtual experiences. Hear how Mary got her first investors for CEEK and how it is used across industries as diverse as education, entertainment, aviation, and healthcare. Additional materials: www.superdatascience.com/889 This episode is brought to you by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:42) What CEEK is and the multiple industries it serves (38:47) How Mary developed VR headsets to reduce nausea experienced by women headset users (42:10) The growing potential for immersive experiences (44:36) How to mitigate the risks of immersive-experience misuse (51:56) Mary’s tips for career success
Mike Pell speaks to Jon Krohn about The Microsoft Garage, a program that drives the culture of innovation at the tech multinational, and how listeners can apply their principles to foster innovation in their workplace. In this Five-Minute Friday, you’ll hear more about Microsoft’s approaches to agentic AI, the future of human-AI collaboration in the workplace, and why experimentation and curiosity are critical skills for the future of work. Additional materials: www.superdatascience.com/888 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jon Krohn speaks to John Roese about the promise of multi-agent teams for business, the benefits of agentic AI systems that can identify and complete tasks independently, and how these systems demand new authentication, authorization, security and knowledge-sharing standards. They also discuss how to use AI to refine project ideas down to a core business need, as well as the new and emerging careers in the tech industry and beyond, all thanks to AI. Additional materials: www.superdatascience.com/887 This episode is brought to you by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:54) Why ROI is the most important aspect of an AI-driven project (14:06) Why high-impact AI projects trigger a flywheel of success (23:32) The future of agentic systems (30:28) How to manage agentic systems at scale (46:36) The disruptive nature of quantum computing
Our In Case You Missed It episode for April has clips on NVIDIA’s and Dell’s product and service offers including an overview of NVIDIA’s GPUs, AI Enterprise, and its microservices. You’ll also hear about AWS’ focus on bringing choice to customers and the incredible power of its Graviton CPU, how Zerve opens access to AI deployment, Merck KGaA, Darmstadt, Germany’s multi-chip integration, and why reliance on the cloud might soon become a practice of times past. Additional materials: www.superdatascience.com/886 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jeroen Janssens and Thijs Nieuwdorp are data frame library Polars’ greatest advocates in this episode with Jon Krohn, where they discuss their book, Python Polars: The Definitive Guide, best practice for using Polars, why Pandas users are switching to Polars for data frame operations in Python, and how the library reduces memory usage and compute time up to 10x more than Pandas. Listen to the episode to be a part of an O’Reilly giveaway! Additional materials: www.superdatascience.com/885 This episode is brought to you by Trainium2, the latest AI chip from AWS, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (07:44) Why Jeroen and Thijs wrote Python Polars: The Definitive Guide (21:54) Best practices in Polars (25:55) Why Polars has so many users (34:32) The benefits of the Great Tables package (51:06) Jeroen and Thijs’ partnership with NVIDIA and Dell for Python Polars: The Definitive Guide
Model Context Protocol (MCP) is Anthropic’s hottest tool, with over 1,000 community-built MCP servers in operation by February alone. In this Five-Minute Friday, Jon Krohn explains what took so long for users to catch on: Anthropic released MCP in November 2024. Hear more about the buzz behind MCP, its applications, and how easy it is to get started. Additional materials: www.superdatascience.com/884 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Returning after the “Super Bowl of AI”, NVIDIA GTC, Sama Bali and Logan Lawler talk to Jon Krohn about their respective work at tech giants NVIDIA and Dell. Sama and Logan discuss the next-gen Blackwell GPUs to their collaboration with Dell in launching Pro-Max PCs specially designed to take on heavy computational workloads as well as the incredible performance of GB 10 and GB 300 workstations, and the widening accessibility of AI developer tools and models. Additional materials: www.superdatascience.com/883 This episode is brought to you by ODSC, the Open Data Science Conference, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (07:29) About Dell’s Pro Max PCs (14:01) Why having a Blackwell GPU from Nvidia is a great option for those new to training and deploying AI models (36:47) When it makes sense for a data scientist to switch from a Unix to a Windows based system (46:33) Logan’s and Sama’s predictions for AI
This week’s five-minute Friday heads to the Netherlands to find out more about Dutch company ASML, the brains behind the lithography machines that build AI chips. Jon Krohn walks through how ASML came to dominate the market, where they’re headed next, and how ASML’s complex machines shape AI chips as well as the very future of AI. Additional materials: www.superdatascience.com/882 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Emily Webber speaks to Jon Krohn about her work at Amazon Web Services, from its Annapurna Labs-developed Nitro System, a foundational technology that can enhance securities and performance in the cloud and how Trainium2 became AWS’ most powerful AI chip with four times the compute of Trainium. Hear the specs of AWS’s chips and when to use them. Additional materials: www.superdatascience.com/881 This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (08:36) Emily’s work on AWS’ SageMaker and Trainium (23:54) How AWS Neuron lets builders tailor their approach to using frameworks (29:07) Why using an accelerator is better than using a GPU (35:29) The key differences between AWS Trainium and AWS Trainium2 (52:45) How to select between AWS Trainium and AWS Trainium2
First developed in China, Manus AI and DeepSeek have made great waves on an international scale. Sought-after for their cost-effectiveness compared to US-made tech, Manus AI and DeepSeek are quickly becoming dominant technologies inside the country. In this five-minute Friday, Jon Krohn asks: Do these technologies warrant the huge amount of resources spent on them by multiple industries in China, and what makes hype become a mainstay? Additional materials: www.superdatascience.com/880 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Greg Michaelson speaks to Jon Krohn about the latest developments at Zerve, an operating system for developing and delivering data and AI products, including a revolutionary feature allowing users to run multiple parts of a program’s code at once and without extra costs. You’ll also hear why LLMs might spell trouble for SaaS companies, Greg’s ‘good-cop, bad-cop’ routine that improves LLM responses, and how RAG (retrieval-augmented generation) can be deployed to create even more powerful AI applications. Additional materials: www.superdatascience.com/879 This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:00) Zerve’s latest features (35:26) How Zerve’s built-in API builder and GPU manager lowers barriers to entry (40:54) How to get started with Zerve (41:49) Will LLMs make SaaS companies redundant? (52:29) How to create fairer and more transparent AI systems (56:07) The future of software developer workflows
AI stacks, AGI, training neural networks, and AI authenticity: Jon Krohn rounds up his interviews from March with this episode of “In Case You Missed It”. In his favorite clips from the month, he speaks to Andriy Burkov (Episode 867), Natalie Monbiot (Episode 873), Richmond Alake (Episode 871) and Varun Godbole (Episode 869). Additional materials: www.superdatascience.com/878 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
NPUs, AIPC, and Dell’s growing suite of AI products: Shirish Gupta speaks to Jon Krohn about neural processing units and what makes them a go-to tool for AI inference workloads, reasons to move your workloads from the cloud and to your local devices, what the mnemonic AIPC stands for and why it will soon be on everyone’s lips, and he offers a special intro to Dell’s new Pro-AI Studio Toolkit. Hear about several real-world AIPC applications run by Dell’s clients, from detecting manufacturing defects to improving efficiencies for first responders, massively supporting actual life-or-death situations. Additional materials: www.superdatascience.com/877 This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:28) What neural processing units (NPUs) are (23:53) About Dell Pro AI Studio (35:03) Use cases for Dell Pro AI Studio (45:16) How AI development workflows and applications will change (49:01) About Dell’s AI factory ecosystem
Small, simple, accessible: Hugging Face makes a huge contribution to the agentic AI wave with its smolagents. Jon Krohn explores how this small-but-mighty new Python library can act as the best personal assistant you never had. Hear about its features and use cases in this five-minute Friday. Additional materials: www.superdatascience.com/876 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Why are semiconductors so essential in this digital age, and how are they made? Jon Krohn speaks to electronics CEO Kai Beckmann about Merck KGaA, Darmstadt, Germany’s intricate manufacturing process, how we can use AI to develop materials that power next-gen AI technologies, and how a chip with the processing power of the human brain might one day be able to run on the power of a low-watt light bulb. Additional materials: www.superdatascience.com/875 This episode is brought to you by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (06:26) How Merck KGaA, Darmstadt, Germany supports groundbreaking developments in AI (13:42) Material science’s biggest challenges for AI (29:55) What heterogeneous integration is (34:37) How optical tech influences the electronics industry (49:04) Navigating upturns and downturns in the semiconductor industry (53:08) How AI regulations benefit humanity
In this Five-Minute Friday, Jon Krohn talks baseball. For decades, coaches have relied on player performance stats to make in-game decisions and refine their season strategies. Now, AI led by Statcast is taking baseball strategy even further, massively broadening analytics data to include pitch, swing and catch trajectories, spin rates, biomechanical information, player matchups, and how to enhance player performances. Listen to the episode to find out what other industries can learn from the “data-friendly” sport of baseball. Additional materials: www.superdatascience.com/874 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Natalie Monbiot is an independent advisor and collaborator for projects that concern the “virtual human”, and she is “going all in on the virtual human economy”. Jon Krohn speaks to Natalie about these new ventures, how to mitigate the divide between AI users and nonusers, and how anyone can collaborate with AI without compromising their own creativity. Additional materials: www.superdatascience.com/873 This episode is brought to you by the Dell AI Factory with NVIDIA, by Trainium2, the latest AI chip from AWS and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (07:21) Natalie’s influences for her work (18:30) Will machines surpass human intelligence? (29:08) Using LLMs as collaborators and partners (40:15) How platforms demand user engagement and time (56:54) Natalie Monbiot at Wizly
In this five-minute Friday, Jon Krohn looks into Microsoft’s recent release of Majorana 1, a new quantum processing unit that uses topological qubits, a step away from the fragile qubits currently in use. Get Jon’s thoughts about this “transistor for the quantum age”, potential applications for quantum computing, and why this marks an exciting future for data science and machine learning practitioners. Additional materials: www.superdatascience.com/872 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Agentic AI, AI success strategies, and why flexibility will be so important to keep up with the AI market: Jon Krohn talks to Richmond Alake about the NoSQL database MongoDB, including why it’s a great addition to your toolkit for developing (agentic) AI applications, with a look under the hood at its native vector database. Richmond also talks about why he expects multi-agent AI architectures to go mainstream in 2025. Additional materials: www.superdatascience.com/871 This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:10) How Richmond became a Staff Developer Advocate (07:40) How NoSQL database differs from a relational database (16:50) The advantages of working with the cloud-based MongoDB Atlas (32:26) Richmond’s predictions for agentic AI (40:38) How to create an effective AI strategy
In this Five-Minute Friday, Jon Krohn looks into what he considers the world’s most powerful research tool to date, OpenAI’s Deep Research. Find out how OpenAI trained Deep Research to compile literature reviews of limitless topics, what similar tools are on the market, and where Jon sees the tool as having real-world value including how he uses it daily. Additional materials: www.superdatascience.com/870 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jon Krohn talks to Varun Godbole about AI prompt engineering, generative wisdom, and AI generalists in this episode all about the interrelationships between humans and AI. Additional materials: www.superdatascience.com/869 This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (10:44) Using deep learning to predict breast cancer (15:55) All about Varun’s Tuning Playbook (29:56) On the explosion of interest and news about AI and data science (46:35) About Varun’s Wise AI
How to start a successful tech company, and how you can get started with DBT, TabPFN and BAML: Jon Krohn rounds up his favorite moments from February in this episode of “In Case You Missed It”. Additional materials: www.superdatascience.com/868 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
The realities of Agentic AI, AGI, and chatbots that don’t hallucinate: Andriy Burkov talks to Jon Krohn about AI in 2025. Best known for his concise machine learning modelling books, author and AI influencer Andriy Burkov also talks about his latest publication in the series, The Hundred-Page Language Learning Models Book. Additional materials: www.superdatascience.com/867 This episode is brought to you by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (07:38) Andriy’s “triology” of books on machine learning (29:32) On the limitations of AI agents (41:12) On the prospect of artificial general intelligence (AGI) (54:24) On developing a chatbot that doesn’t hallucinate (01:10:07) On open-weight and open-source LLMs
Jon Krohn addresses a question for the ages: How close are we, really, to Jurassic Park? Dallas-based biotech company Colossal Biosciences is developing technology that aims to return previously extinct animals like the dodo and woolly mammoth to earth and, crucially, pull many others like the white rhino back from the brink of extinction. Additional materials: www.superdatascience.com/866 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jon Krohn talks to Cal Al-Dhubaib about the extraordinary success of AI and machine learning solutions provider Pandata, his ironclad hack for any company to define their core values, and how to attract and secure loyal clients. Cal thinks tech professionals make two critical mistakes in their careers: The first is that they too-often enjoy being the gatekeepers of their work rather than educating their clients and coworkers as to the details of their projects and why it benefits the company. The second is that tech professionals don’t show vulnerability, whether that means not knowing a topic or not fully understanding how a business works. This issue, Cal says, can spell the difference between a startup’s success and failure. Learn how tech startups can make an ironclad strategy for their future in this episode of The SuperDataScience Podcast. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (09:32) How to scale a successful data science consultancy (22:25) How Pandata navigates highly regulated environments (27:59) How to tackle tech illiteracy in business (36:32) What skills Cals looks for in new hires (35:56) How to sell on a tech company Additional materials: www.superdatascience.com/865
Jon Krohn investigates OpenAI’s new release, o3-mini, in this five-minute Friday, where he walks through the reasoning model’s capabilities and performance, cross-examining them against other major-league players, DeepSeek-R1, GPT-4o and Claude 3.5 Sonnet. Additional materials: www.superdatascience.com/864 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (05:57) All about the TabPFN architecture (21:27) Use cases for Bayesian inference (35:07) On getting published in Nature (44:03) How TabPFN handles time series data (51:52) All about Prior Labs Additional materials: www.superdatascience.com/863
In this episode of “In Case You Missed It”, Jon Krohn shares his favorite clips from the last four weeks. He talks to Azeem Azhar, Florian Neukart, Kirill Eremenko, Hadelin de Ponteves, and Brooke Hopkins on what’s in store for AI in 2025, from quantum computing and customizable tools to handy checklists and how the mathematics of exponentials can help us keep our heads about the swift advancement of AI. Additional materials: www.superdatascience.com/862 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
How does a CrossFit winner, bobsledder and swimmer go on to have a glittering career in data analytics and engineering? Colleen Fotsch talks to Jon Krohn about transitioning into very different career paths, how sports gave her the competitive mindset she needed for success in data science, and seeing the niche role of analytics engineering as a bridge between data engineering and analysis. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (05:49) Colleen’s path from athlete to data analyst (1:14:41) About the data build tool (DBT) (1:22:51) Colleen’s work at CHG Healthcare (1:32:45) How Colleen and Tia-Clair got started with PRVN GO Additional materials: www.superdatascience.com/861
DeepSeek-curious? This Five-Minute Friday is for you! Jon Krohn investigates the overwhelming overnight success of this new LLM, the product of a Chinese hedge fund. DeepSeek is a market newcomer, and yet it runs shoulder to shoulder with behemoths from OpenAI, Anthropic and Google like it’s all in a day’s work. Additional materials: www.superdatascience.com/860 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
In this week’s guest interview, Vaibhav Gupta talks to Jon Krohn about creating a programming language, BAML, that helps companies save up to 30% on their AI costs. He explains how he started tailoring BAML to facilitate natural language generation interactions with AI models, how BAML helps companies optimize their outputs, and he also lets listeners into Boundary’s hiring process. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:53) What BAML stands for (14:33) Making a prompt engineering a serious practice (18:00) How BAML helps companies (23:30) Using retrieval-augmented generation (RAG) (43:09) How to get a job at Boundary Additional materials: www.superdatascience.com/859
Are you an Account Executive with experience in the technology sector? In this Five-Minute Friday, Jon Krohn tells listeners about an exciting new role that has opened up at The SuperDataScience Podcast. Additional materials: www.superdatascience.com/858 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Brooke Hopkins speaks to Jon Krohn about technology’s new frontiers in AI agents, how these agents will impact society, work and our creative enterprises, and what this might mean for our data-driven future. You will learn how Coval, a simulation and evaluation platform for AI voice and chat agents, helps companies balance precision and scalability while making few concessions on the way. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (07:49) What Coval does and how the platform works (21:16) Coval’s workflows (37:40) The future of AI agents (46:28) The metrics to evaluate performance (55:08) How close we are to achieving AI agent autonomy Additional materials: www.superdatascience.com/857
Get excited: The fastest-growing jobs in the US are AI Engineer and AI Consultant. In this Five-Minute Friday, Jon Krohn looks into the reports that reveal this job growth, and the trends any data scientist and AI professional will want to watch in 2025. Additional materials: www.superdatascience.com/856 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
How can we use AI to solve global problems like the environmental crisis, and how will future AI start to manage increasingly complex workflows? Famed futurist Azeem Azhar talks to Jon Krohn about the future of AI as a force for good, how we can stay mindful of an evolving job market, and Azeem’s favorite tools for automating his workflows. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (05:43) Azeem Azhar’s vision for AI’s future (14:16) How to prepare for technological shifts (20:35) How to be more like an AI-first company (38:46) The tools Azeem Azhar uses regularly (50:09) The benefits and risks of transitioning to renewable energy (1:09:28) Opportunities in the future workplace Additional materials: www.superdatascience.com/855
Join Jon Krohn as he unpacks Ray Kurzweil’s six epochs of intelligence evolution, a fascinating framework from The Singularity is Nearer. From the origins of atoms and molecules to the transformative future of brain-computer interfaces and cosmic intelligence, Jon explores how each stage builds upon the last. This quick yet profound journey reveals how humanity is shaping the Fifth Epoch—and hints at what’s next for intelligence in our universe. Additional materials: www.superdatascience.com/854 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Kirill Eremenko and Hadelin de Ponteves AI educators, whose courses have been taken by over 3 Million students, sit down with Jon Krohn to talk about how foundation models are transforming businesses. From real-world examples to clever customization techniques and powerful AWS tools, they cover it all. bravotech.ai - Partner with Kirill & Hadelin for GenAI implementation and training in your business. Mention the “SDS Podcast” in your inquiry to start with 3 complimentary hours of consulting. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (07:00) What are foundation models? (15:45) Overview of the foundation model lifecycle: 8 main steps. (29:11) Criteria for selecting the right foundation model for business use. (41:35) Exploring methods to customize foundation models. (53:04) Techniques to modify foundation models during deployment or inference. (01:11:00) Introduction to AWS generative AI tools like Amazon Q, Bedrock, and SageMaker. Additional materials: www.superdatascience.com/853
AI security, LLM engineering, how to choose the best LLM, and tech agnosticism: In our first “In Case You Missed It” of 2025, Jon Krohn starts the year with a round-up of our favorite recent interview moments. He selects from interviews with Andrew Ng, Ed Donner, Eiman Ebrahimi, Sadie St Lawrence, and Greg Epstein, covering the latest in AI development, touching on agentic workflows, promising new roles in AI, and what blew our minds last year. Additional materials: www.superdatascience.com/852 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Are our passwords safe, even with the increasing accessibility of quantum computing? Florian Neukart, Chief Product Officer at Terra Quantum AG, thinks so. In this episode, he outlines the three key elements of quantum-safe security. He speaks to Jon Krohn about the resourceful applications of quantum computing and workarounds for the demands of quantum computing on operational times and cooling systems. And if you’re interested in making the switch to quantum computing from machine learning, he also explores what you need (and don’t need) to make change happen. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (17:12) The real-world applications of quantum computing (23:35) The chips needed for quantum computing (31:18) How quantum computing meets key business challenges (46:33) The ethical challenges of quantum technology (49:28) How to become proficient in quantum computing (1:01:21) The future of quantum computing Additional materials: www.superdatascience.com/851
A new year often draws our focus towards fresh approaches to the way we work and structure our day. For Jon Krohn, the continuous calendar gives him a realistic and uninterrupted overview of his time. Plus, it’s customizable and free! In this episode, Jon also shares his plans and priorities for the New Year, and he recommends how you can assess and achieve your goals for the year; critical advice for anyone who wants to create manageable and sustainable milestones in 2025. Additional materials: www.superdatascience.com/850 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Sadie St Lawrence returns for her 4th annual prediction episode on the Super Data Science Podcast. Together with host Jon Krohn, they reflect on 2024’s most transformative trends—like agentic AI and enterprise AI monetization—and predict what's coming in 2025, from AI-driven science to the skills data scientists need to stay ahead. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:30) 2024 AI trend recap (19:23) Comeback of the year: Google (27:29) Wow moment of the year (40:20) Looking ahead to 2025 Additional materials: www.superdatascience.com/849
In this Five-Minute Friday episode, Jon Krohn reflects on 2024’s monumental year in AI, highlighting the rapid rise of generative AI and its impact across industries. From functional coding breakthroughs to independently acting AI agents, we explore the transformative power of these advancements and the promise they hold for 2025. Jon shares optimism for the future of AI and humanity's ability to harness it for the greater good. Additional materials: www.superdatascience.com/848 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Ed Donner co-founded AI-driven recruitment platform, Nebula.io, with The SuperDataScience Podcast’s host, Jon Krohn. Ed and Jon reminisce about how they launched their company, the growing opportunities for data scientists, how to choose an LLM, and today’s top technical terms in AI. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (11:15) What an AI engineer does (19:23) Defining today’s key terms in AI: RAG, fine tuning, agentic. (27:09) How to select an LLM (49:41) Pitting LLMs against each other in a game (53:14) What to do once you’ve selected an AI model Additional materials: www.superdatascience.com/847
In this Five-Minute Friday, Jon Krohn speaks to Anu Jain, CEO of Nexus Cognitive, and Mahesh Kumar, CMO of Acceldata. They talk about the importance of updating data, especially for predictive models that make key financial decisions for a company, as well as the current state of data governance and why it’s overdue its own update. Additional materials: www.superdatascience.com/846 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Discover how technology has become the modern belief system shaping our world. Greg Epstein, author of Tech Agnostic: How Technology Became the World's Most Powerful Religion, and Why It Desperately Needs a Reformation, draws striking parallels between tech culture and traditional faiths. From AI's "singularity" echoing prophetic narratives to Silicon Valley’s promises of salvation through innovation, Greg uncovers the profound influence of technology on our lives. He challenges us to rethink blind faith in progress, focus on genuine human connection, and navigate a future where ethics and empathy guide innovation. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (08:30) How can someone cultivate connection without religion? (15:49) Social media as a new form of community (17:00) Tech's transformation into a religion (56:08) How to set boundaries with tech (01:01:32) The singularity as a religious narrative (01:19:53) Transhumanism and effective altruism as tech cults (01:15:00) Defining tech agnosticism (01:26:55) Prioritizing human connection in a tech-driven world Additional materials: www.superdatascience.com/845
In this episode of “In Case You Missed It”, in which we round up our favorite moments from the previous month of interviews, Jon Krohn asks his guests about the future of recruitment and job applications, the multiple pathways to a career in AI, the potential of AI in developing proteins for improved healthcare, and how “AI celebrity” doesn’t necessarily equate to “AI expert”. Additional materials: www.superdatascience.com/844 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
What’s holding your AI projects back from success? Dr. Eiman Ebrahimi, CEO of Protopia AI and former NVIDIA scientist, takes us on a fascinating journey through the challenges of AI data security and enterprise scalability. Learn how to escape "proof of concept purgatory," unlock profitable AI solutions, and tackle the trade-offs between cost, speed, and security. Plus, discover how the philosophy of Alan Watts can inspire innovation and drive meaningful change in the world of AI. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (02:53) Protopia’s role in AI data security and privacy (11:45) The functionality behind Stained Glass Transform (22:20) Eiman’s journey from NVIDIA to founding Protopia (25:37) Challenges enterprises face with ROI on AI projects (36:40) Multi-tenancy in AI systems (55:37) Stained Glass Transform’s privacy-preserving capabilities (01:09:31) Emerging trends in AI (01:14:55) Alan Watts’ philosophies and their link to entrepreneurship Additional materials: www.superdatascience.com/843
In this Five-Minute Friday, Jon interviews Chris Bennett and Joseph Balsamo on the importance of flexibility in the way we deploy AI models, Dell’s brand positioning in the AI space, and whether GenAI’s business applications stand up to the hype. Additional materials: www.superdatascience.com/842 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
In this special episode recorded live at ScaleUp:AI in New York, Jon Krohn speaks to Andrew Ng in response to his conference talk on smart agentic AI workflows. Jon follows up with Andrew about smart agentic workflows and when to use them, how businesses should direct their efforts in investing in AI, and the new ways that AI tools can process visual and unstructured data. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (06:13) How to weigh up cost and effectiveness in new AI workflows (12:08) The crucial elements for building effective vision AI applications (15:34) How large vision models might transform global industries (18:40) How to mitigate risk in people not verifying accuracy in answers generated by agents Additional materials: www.superdatascience.com/841
What do AI, robotics, and premium wine grapes have in common? Everything, as it turns out. In this episode, we explore viticultural robotics a revolutionary project combining machine learning, spectroscopic sensors, and VR-controlled robotics to tackle one of agriculture’s trickiest challenges: harvesting delicate wine grapes worth over $6,000 per tonne. From vineyards in the UK to cutting-edge labs, discover how these innovations could transform not just viticulture but the entire future of precision agriculture. Additional materials: www.superdatascience.com/840 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jess Ramos is redefining success in data analytics. As the Founder of Big Data Energy and a Senior Data Analyst at Crunchbase, she’s mastered the art of salary negotiation, built a massive social media following, and turned her passion for data into a thriving personal brand. She reveals how she doubled her salary in under a year, created her own SQL course, and advocates for women in STEM. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:42) How Jess got her start in data analytics (09:14) Why SQL is the most critical skill for data professionals (11:46) How Jess more than doubled her salary in less than a year (20:16) Tips for transitioning from a data job to creating your own business (31:20) The various routes to a career in data science (39:13) How Jess challenges STEM stereotypes Additional materials: www.superdatascience.com/839
Jon Krohn heads to Lisbon for an interview hosted by Bella Shing, Chapter Lead for Light Dao. He shares the stage with Regarding Consciousness podcast host, Jennifer Hill, where the three discuss AI philosophy and consciousness. Additional materials: www.superdatascience.com/838 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Deepali Vyas, Global Head of Applied Intelligence (Data Science & AI) and FinTech at Korn Ferry, talks to Jon Krohn about the best ways for data science and AI professionals to get seen and hired. Hear why video, not text, is the future of recruitment, how to get over camera shyness, and how to make a winning impression on job recruiters. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (09:49) On using GenAI to get hired (27:44) The future of video in recruitment (40:36) Tips for the camera-shy (44:15) How Fearless+ started (54:51) How AI helps organizations to ensure equity (57:43) Green-flag behaviors at work Additional materials: www.superdatascience.com/837
Economist and social-impact innovator Dr. Nat Ware reveals how our expectations shape happiness and why chasing it often leaves us unfulfilled. He shares insights on the “hedonic treadmill” and the effects of constant comparison on our well-being. Find out how to build a more meaningful life by making memories, taking chances, and focusing on genuine connections. Additional materials: www.superdatascience.com/836 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
AI systems are evolving rapidly, and in this episode, Bryan McCann, CTO of You.com, explains You.com’s unique approach to search, the impact of AI-driven research, and the game-changing potential of AI agents. With a background in natural language processing and philosophy, Bryan joins Jon Krohn to share a fresh perspective on where AI is headed and what it means for the future of work and scientific discovery. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (03:55) How You.com’s “do engine” approach connects users to multiple language models (11:34) How AI systems at You.com generate optimized, intent-driven queries for better results (28:39) You.com’s focus on automated workflows sets it apart from other platforms (31:31) AI agents in You.com, with Bryan predicting they’ll outnumber people by 2025 (41:49) Bryan’s path to unified AI models that can perform diverse tasks (50:40) Early experiments with alignment in AI that influenced modern transformers (01:04:45) Bryan’s research on controllable text generation (01:11:27) Language models applied to protein generation, linking text and biology sequences Additional materials: www.superdatascience.com/835
Jon Krohn starts the month with his round-up of favorite clips from the previous month. Hear from Bradley Voytek, Natalie Monbiot, Luca Antiga, Chad Sanderson, and Ritchie Vink in conversations about the ongoing potential of AI. Additional materials: www.superdatascience.com/834 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Martin Goodson speaks to Jon Krohn about what he would add to his viral article “Ten Ways Your Data Project is Going to Fail”, why practitioners always need to be present at AI policy discussions, and Evolution AI’s breakthroughs in computer vision and NLP. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (04:25) What Evolution AI does (11:41) How to maintain accuracy in large infrastructures (21:22) How to cultivate innovation and creativity while meeting market demands (24:27) Potential knowledge gaps for machine learning practitioners (30:57) Martin’s viral article, “Ten Ways Your Data Project is Going to Fail” (59:54) Strategies for the UK to become a key player in AI Additional materials: www.superdatascience.com/833
Host Jon Krohn unpacks Dario Amodei’s vision of a techno-utopia in his essay Machines of Loving Grace, where “Powerful AI” takes center stage. Amodei, CEO of Anthropic, imagines a future where AI doesn’t just assist but actively shapes fields like healthcare, economics, and governance with unmatched intelligence and autonomy. Jon explores the possibilities and challenges of this AI-driven future, asking how close we are to seeing these revolutionary shifts and what they mean for society. Additional materials: www.superdatascience.com/832 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
PyTorch Lightning is revolutionizing the AI landscape, and Dr. Luca Antiga, CTO of Lightning AI, joins host Jon Krohn to explain how. In this episode, they explore the tools pushing AI development forward, from Lightning Studios to Lit-Serve, and discuss the game-changing rise of small language models that challenge industry giants with precision and speed. Luca also shares his vision for developers in an AI-enhanced world, where coding meets creativity and collaboration with intelligent tools. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: How Lightning AI's open-source tools make AI development faster [11:30] The rise of small language models and how they'll rival LLMs [37:47] Luca's journey from biomedical imaging to deep learning pioneer [52:03] How AI will transform software developer tasks [1:03:05] Additional materials: www.superdatascience.com/831
Geoffrey Hinton and Sir Demis Hassabis: The Nobel Prize committee is an achievement of the highest order, awarding physicists, chemists, physiologists, medical practitioners, writers, pacifists and economists perhaps the greatest honor in their respective fields. In this week’s Five-Minute Friday, Jon Krohn discusses how two AI pioneers came to win prizes in chemistry and physics. Additional materials: www.superdatascience.com/830 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Neuroscientist Bradley Voytek outlines to Jon Krohn the incredible use of data science and machine learning in his research and how recent discoveries in action potentials and neurons have completely skyrocketed the field to a new understanding of the brain and its functions. You’ll also hear what Bradley thinks is most important when hiring data scientists and his contributions to Uber’s algorithm when it was still a startup. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: Breakthroughs in brain region communication [04:08] The future of brain research and MedTech [35:24] The libraries and software used at the Halicioglu Data Science Institute [45:11] Brain rhythm as a diagnostic tool [1:02:58] Bradley’s curriculum structure at UC San Diego [1:12:21] How Uber applies data science [1:20:07] Additional materials: www.superdatascience.com/829
The citizen data scientist: Fact or fiction? Jon Krohn holds a conversation across episodes in this Five-Minute Friday, with today’s guest Keith McCormick, in part responding to Nick Elprin’s interview in episode 811: Scaling Data Teams Effectively. Additional materials: www.superdatascience.com/828 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Ritchie Vink, CEO and Co-Founder of Polars, Inc., speaks to Jon Krohn about the new achievements of Polars, an open-source library for data manipulation. This is the episode for any data scientist on the fence about using Polars, as it explains how Polars managed to make such improvements, the APIs and integration libraries that make it so versatile, and what’s next for this efficient library. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: Why Polars is so efficient [05:20] Polars’ easy integration with other data-processing tools [21:23] Eager vs lazy executive in Polars [32:15] Polars’ data processing of large- and small-scale datasets [38:28] Ritchie’s plans to scale his company [46:14] Upcoming features in Polars [58:06] Additional materials: www.superdatascience.com/827
Next-gen IDEs, efficiency-boosting open-source Python libraries, and changes in hiring for data scientists: This episode of In Case You Missed It gives you our best clips of September’s interviews, hosted by Jon Krohn. Additional materials: www.superdatascience.com/826 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they’re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: What data contracts are and how they define expectations for data quality [03:16] What data contracts look like [09:09] The common misconceptions about data quality when implementing AI [12:55] Chad’s Chief Operator role at Data Quality Camp [19:46] How “shifting left” improves data reliability by addressing issues early [24:17] Why data professionals still struggle with data quality [30:31] How data debt forms and why it leads to complex, inefficient architectures [35:53] How will the role of human oversight evolve in ensuring data quality? [47:12] How can data teams leverage storytelling? [52:33] Additional materials: www.superdatascience.com/825
Llama 3.2 brings a new era of AI innovation with lightweight models tailored for on-device applications and powerful vision models for handling complex image inputs. Host Jon Krohn explores how this release pushes the boundaries of open-source AI, making it more accessible and versatile for developers. He also covers the Llama Stack toolkit, designed to streamline deployment, and Llama Guard 3, Meta’s latest content moderation solution. With extensive support from major cloud and hardware partners, Llama 3.2 is set to unlock groundbreaking possibilities for AI across mobile and beyond. Tune in to hear more. Additional materials: www.superdatascience.com/824 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Virtual humans are rewriting the rules of digital communication and reshaping entire industries. This week, Jon Krohn welcomes Natalie Monbiot, Head of Strategy at Hour One, to shed light on how AI avatars are revolutionizing L&D and e-commerce by turning traditional training and product listings into captivating, presenter-led content. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • How do you create a virtual being? [10:55] • Reid Hoffman's avatar [13:40] • The virtual human economy [31:07] • Virtual human societies [51:24] • Virtual humans and creative expression [56:35] • Challenges in maintaining transparency [01:00:22] Additional materials: www.superdatascience.com/823
NotebookLM, Google’s latest AI tool, takes content creation to a new level. This week, Jon Krohn shares how the platform transformed his 200-page dissertation into a fascinating 11-minute podcast. Discover how AI can turn vast amounts of information into engaging and digestible content, opening up new possibilities for content creation. Additional materials: www.superdatascience.com/822 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Marck Vaisman speaks to Jon Krohn about his paradigm for understanding core data practitioner types. Hear Marck detail the four data practitioner personas that he has identified in his research, why he believes the roadmaps that influencers like to promote as surefire ways to a data science career don’t work in practice, and why the term “data scientist” is still so elusive and hard to recruit for. This episode is brought to you by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • How Marck started his work in defining data science roles [08:06] • The relationship between the four data practitioner personas [15:26] • About Marck’s “menu” for effective data science [40:43] • How recruiters can hire the best data scientist for the job [59:31] Additional materials: www.superdatascience.com/821
Jon Krohn takes OpenAI’s new models (o1-preview and o1-mini) for a spin in this Five-Minute Friday, learning their key strengths and limitations, and how the o1 series may represent yet another landmark for generative AI. Additional materials: www.superdatascience.com/820 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
SuperDataScience veteran and Udemy teacher Luka Anicin is on the podcast to talk about his brand-new course, “PyTorch: From Zero to Hero”, available exclusively on superdatascience.com. Host Jon Krohn asks Luka why he feels that every data scientist should consider PyTorch as their default Python library, and why “keeping it simple” can secure the success of a machine learning project. This episode is brought to you by AWS Inferentia and AWS Trainium, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • About the PyTorch library [03:29] • Why PyTorch became so popular [25:24] • How to increase accuracy and efficiency in PyTorch [31:49] • How to utilize transfer learning [35:44] • Why real-world projects are essential to data scientists [41:10] • About Datablooz [46:49] Additional materials: www.superdatascience.com/819
Experts from AI and data science discuss the impact and benefits of decentralization, the importance of structuring AI systems in business, and why knowing the basics will always matter for data engineers. Listen to Shingai Manjengwa (episode 809), Daniel Hulme (episode 807), Jerry Yurchisin (episode 813) and Nick Elprin (episode 811) explore a future world of work that rewards continuing learners, sets tasks for the people best suited to complete them rather than those whose job titles reflect the spec, and applies a fleet of ‘AI agents’ to solve complex business tasks. Additional materials: www.superdatascience.com/818 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Dr. Julia Silge, Engineering Manager at Posit, introduces the brand-new Positron IDE, perfect for exploratory data analysis and visualization. She also lays out her top picks for LLMs that boost coding efficiency and discusses when traditional NLP methods might be the smarter choice over LLMs. Plus, Julia highlights some must-know open-source libraries that make managing MLOps easier than ever. Tune in for insights that every data scientist, ML engineer, and developer will find useful. This episode is brought to you by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • Overview of Posit and Positron IDE [05:20] • How the needs of a data scientist differ from those of a software developer [10:54] • How to contribute to the open-source Positron [19:50] • MLOps and Vetiver: Tools for deploying and maintaining ML models [37:01] • Natural Language Processing (NLP) and the Tidyverse approach [50:34] • The role of AI and LLMs in data science education [1:24:18] Additional materials: www.superdatascience.com/817
Jon Krohn takes on a listener's challenge to explain his work in data science to his 94-year-old grandmother, Annie. This heartwarming conversation covers what data is, the role of a data scientist, and breaks down artificial intelligence (AI) and artificial general intelligence (AGI) in simple terms. The episode provides a fresh take on how to communicate complex topics to a lay audience, offering both clarity and insight. Additional materials: www.superdatascience.com/816 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Polars, Python, Narwhals, Rust, and Pandas: Marco Gorelli talks to Jon Krohn about the many ways to use the newest data libraries available, the joys of open-source development, and the best method to win prizes in forecasting competitions. This episode is brought to you by AWS Inferentia and AWS Trainium, by Babbel, the science-backed language-learning platform, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • When to use Polars vs Pandas [08:26] • How Polars optimizes string operations and data processing [20:08] • Where Narwhals outstrips Polars and Pandas [48:37] • The benefits of using Altair [55:21] • Addressing the lack of women in data science [1:09:58] • How to win a forecasting competition [1:16:58] Additional materials: www.superdatascience.com/815
As summer winds down, this episode shifts focus from the usual tech discussions to something more personal: reflecting on the importance of balancing work with life’s simple pleasures. While the world of data science and AI continues to evolve rapidly, it's essential to remember that true success isn't just about professional milestones. It’s also about cherishing the moments that make life meaningful. Tune in for a brief but impactful reflection on how to redefine success to include not just achievements, but also the everyday joys that often go unnoticed. Additional materials: www.superdatascience.com/814 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jerry Yurchisin from Gurobi joins Jon Krohn to break down mathematical optimization, showing why it often outshines machine learning for real-world challenges. Find out how innovations like NVIDIA’s latest CPUs are speeding up solutions to problems like the Traveling Salesman in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • The Burrito Optimization Game and mathematical optimization use cases [03:36] • Key differences between machine learning and mathematical optimization [05:45] • How mathematical optimization is ideal for real-world constraints [13:50] • Gurobi’s APIs and the ease of integrating them [21:33] • How LLMs like GPT-4 can help with optimization problems [39:39] • Why integer variables are so complex to model [01:02:37] • NP-hard problems [01:11:01] • The history of optimization and its early applications [01:26:23] Additional materials: www.superdatascience.com/813
In this episode of Five-Minute Friday, Jon Krohn investigates published findings from the startup Sakana AI and its paper’s co-authors from the University of Oxford, the University of British Columbia and the Vector Institute in Toronto. These authors explore the potential of The AI Scientist, a framework that could change the way we conduct scientific research forever. Additional materials: www.superdatascience.com/812 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Nick Elprin talks to Jon Krohn about how and when to scale a data science team and its workflows to secure a company’s commercial viability. You’ll also hear how to launch your own data science startup and why it’s so important to understand that AI tools are not one-size-fits-all. This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • How Nick served enterprises with his AI startup, Domino Data Lab [05:36] • About the Navy’s own mine detection models [17:43] • The hype surrounding GenAI [30:35] • How AI platforms integrate with business strategies [39:49] • When it’s time to integrate an AI tool into your business [51:12] • Why Nick started Domino Data Lab [1:03:53] Additional materials: www.superdatascience.com/811
Self-driving cars are here, and Jon Krohn is breaking down the five levels of automation that could change driving forever. From full human control at Level 0 to cars that drive themselves in any condition at Level 5, get the real story on what these levels mean. With firsthand insights from a recent autonomous vehicle experience, this episode cuts through the buzz and tells you what’s coming next. Additional materials: www.superdatascience.com/810 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Agentic AI is revolutionizing the tech landscape, and Shingai Manjengwa from ChainML is here to tell us why. Discover how AI agents are becoming an integral part of our lives, automating tasks like travel bookings and daily inspiration. Shingai explains the power of multi-agent systems, where AI agents collaborate to solve complex challenges, and highlights how blockchain technology is enhancing AI transparency and trust. Plus, get an inside look at ChainML’s innovative Theoriq protocol and the groundbreaking Council Analytics tool. This episode is brought to you by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • What A.I. agents are [10:51] • How blockchain technology helps humans trust A.I. agents [18:27] • The Theoriq protocol developed by ChainML [34:05] • How Council Analytics lets you “speak” to their dataset with natural language [39:00] • A future of multi-agent systems [50:42] • Challenges and risks associated with agentic AI [1:04:17] Additional materials: www.superdatascience.com/809
Advice for emerging data scientists, the latest in model merging, and how GenAI can supercharge your creativity: Host Jon Krohn gives us his highlights from a month of interviews, packed with tips from some of the leading names in data science and beyond. Guests include Daliana Liu, Charles Duhigg, Charles Goddard, Rosanne Liu and Andrey Kurenkov. Additional materials: www.superdatascience.com/808 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
The singularity could soon be upon us. The PESTLE framework, developed by this episode’s guest Daniel Hulme, expresses not one but six types of singularity that could occur: political, environmental, social, technological, legal and economic. Jon Krohn and Daniel Hulme discuss how each of these singularities could bring good to the world, aligning with human interests and pushing forward progress. They also talk about neuromorphic computing, machine consciousness, and applying AI at work. This episode is brought to you by AWS Inferentia and AWS Trainium, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • About the six singularities [03:43] • How the singularity could improve life on earth [09:01] • The credibility of AI experts [32:51] • How the decentralization of technology could benefit earth [43:14] • How AI might enhance creativity [1:04:33] Additional materials: www.superdatascience.com/807
Llama 3.1 is here, and it’s a game-changer. Meta’s latest AI model, especially the massive 405B variant, finally brings an open-source option to compete with giants like OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet. While Meta didn’t fully open-source everything, the availability of "open weights" is a strategic move to shake up the AI landscape. The model boasts an impressive 128,000-token context window and multilingual support in eight languages. Meta is also focusing on responsible AI development with tools like Llama Guard 3 for content moderation. This release is more than just a tech upgrade—it's about democratizing AI and sparking innovation across industries. How will you leverage Llama 3.1 to make a real impact? Tune into this week’s FMF episode and let’s explore the future with this latest AI development together. Additional materials: www.superdatascience.com/806 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Become a Supercommunicator! New York Times bestselling author Charles Duhigg, known for The Power of Habit and Smarter Faster Better, gets real about mastering communication in this episode. Discover insights from his latest book, Supercommunicator, where he reveals how to align conversation styles for deeper connections, handle conflicts effectively, and why AI can't replicate the emotional depth of human interactions. This episode is brought to you by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • The inspirations behind Supercommunicator [03:41] • The three types of conversations: Practical, emotional, and social conversations [05:22] • The matching principle: Align communication styles for better connection [10:36] • What is neural entrainment: Achieve a mind meld through synchronized brain activity [13:22] • The series of steps/principles to connect with someone [24:39] • How to avoid or de-escalate conflict conversations [31:07] • The impact of GenAI on conversations: How AI mimics dialogue but lacks emotional depth [45:24] Additional materials: www.superdatascience.com/805
Solar power now provides 6% of the world's electricity, thanks to rapid growth. Host Jon Krohn discusses the factors driving this rise, the challenges ahead, and how AI and data science are optimizing solar technologies. Tune in for insights on the future of solar power, and don't forget to like, share, and subscribe! Additional materials: www.superdatascience.com/804 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Daliana Liu is a big name in data science teaching, and she has always been generous in sharing everything she knows about getting a job in data science. In this episode, she continues to extend her generosity, helping listeners define their approach to achieving a fulfilling career in data science and tech. This episode is brought to you by AWS Inferentia and AWS Trainium, by Babbel, the science-backed language-learning platform, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • Common career challenges for data scientists [34:57] • Advice for people who don’t know where to go in their career [48:05] • How to build resilience and protect against Imposter Syndrome [1:06:23] • Skills that data scientists should develop today [1:39:17] • The future of the data science and AI job market [1:46:55] Additional materials: www.superdatascience.com/803
How to grab investor interest with your AI startup idea, revisiting algorithms, and helping practitioners ensure AI safety with regulatory frameworks and beyond: This month, you missed a whole bunch of great interviews. But don’t worry, Jon Krohn is here to recap all the best bits for you! Additional materials: www.superdatascience.com/802 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Merged LLMs are the future, and we’re exploring how with Mark McQuade and Charles Goddard from Arcee AI on this episode with Jon Krohn. Learn how to combine multiple LLMs without adding bulk, train more efficiently, and dive into different expert approaches. Discover how smaller models can outperform larger ones and leverage open-source projects for big enterprise wins. This episode is packed with must-know insights for data scientists and ML engineers. Don’t miss out! Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • Explanation of Charles' job title: Chief of Frontier Research [03:31] • Model Merging Technology combining multiple LLMs without increasing size [04:43] • Using MergeKit for model merging [14:49] • Evolutionary Model Merging using evolutionary algorithms [22:55] • Commercial applications and success stories [28:10] • Comparison of Mixture of Experts (MoE) vs. Mixture of Agents [37:57] • Spectrum Project for efficient training by targeting specific modules [54:28] • Future of Small Language Models (SLMs) and their advantages [01:01:22] Additional materials: www.superdatascience.com/801
The SuperDataScience Podcast is celebrating its 800th episode! Host Jon Krohn speaks to his grandmother, Annie, about growing up at a time when so many technologies we take for granted today were yet to be developed. Listen in to hear Annie’s experience of the changes in technology across 94 years and how she and her family fared in 1940s Ukraine with no electricity or running water. Additional materials: www.superdatascience.com/800
No-code games with GenAI, the creative possibilities of LLMs, and our proximity to AGI: In this episode, Jon Krohn talks to Andrey Kurenkov about what turned him from an AGI skeptic to a positivist. You’ll also hear about his wildly popular podcast “Last Week in AI” and how the NVIDIA-backed startup Astrocade is helping videogame enthusiasts to create their own games through generative AI. A must-listen! This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • All about The Gradient and Last Week in AI [10:42] • All about Astrocade and Andrey’s role at the startup [24:35] • Balancing UX and creative control at Astrocade [42:00] • The creative possibilities of LLMs [1:04:15] • The rapid emergence of AGI [1:10:31] Additional materials: www.superdatascience.com/799
Claude 3.5 Sonnet, Anthropic’s newest model, is making waves in the AI community. This mid-size model outshines the larger Claude 3 Opus in tasks like code generation, content creation, and document summarization, and it’s twice as fast. In this episode of The Super Data Science Podcast, Jon Krohn discusses its top-notch performance across benchmarks like MMLU, GPQA, and HumanEval, along with its improved machine vision capabilities. Plus, learn about the new Artifacts UI feature, which makes managing generated content easier by displaying outputs side-by-side with inputs. Tune in to find out why Claude 3.5 Sonnet is setting new standards in AI. Additional materials: www.superdatascience.com/798 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Dr. Rosanne Liu, Research Scientist at Google DeepMind and co-founder of the ML Collective, shares her journey and the mission to democratize AI research. She explains her pioneering work on intrinsic dimensions in deep learning and the advantages of curiosity-driven research. Jon and Dr. Liu also explore the complexities of understanding powerful AI models, the specifics of character-aware text encoding, and the significant impact of diversity, equity, and inclusion in the ML community. With publications in NeurIPS, ICLR, ICML, and Science, Dr. Liu offers her expertise and vision for the future of machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • How the ML Collective came about [03:31] • The concept of a failure CV [16:12] • ML Collective research topics [19:03] • How Dr. Liu's work on the “intrinsic dimension” of deep learning models inspired the now-standard LoRA approach to fine-tuning LLMs [21:28] • The pros and cons of curiosity-driven vs. goal-driven ML research [29:08] • Discussion on Dr. Liu's research and papers [33:17] • Character-aware vs. character-blind text encoding [54:59] • The positive impacts of diversity, equity, and inclusion in the ML community [57:51] Additional materials: www.superdatascience.com/797
Want to feel optimistic about your day? In this Friday episode, Simon Kuestenmacher talks to Jon Krohn about demography: What it is, why it’s so important, and why its forecasts should give us reason to hope for a better future. In an increasingly globalized world, and with an aging population in countries with the biggest GDPs, demography is more valuable than ever. Additional materials: www.superdatascience.com/796 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Gina Guillaume-Joseph talks to Jon Krohn about the data and regulatory frameworks set to transform the AI industry and why that’s important to anyone working with data. This episode offers a solid path to understanding AI regulation’s past, present and future. Gina walks listeners through the AI Bill of Rights, the NIST AI Risk Framework and the MITRE ATLAS threat model. This episode is brought to you by AWS Inferentia and AWS Trainium, by Crawlbase, the ultimate data crawling platform, and by Babbel, the science-backed language-learning platform. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • What “responsible AI” means [08:14] • Why the federal government should be behind AI regulation [12:22] • The US vs EU on AI regulation [18:46] • About the AI Bill of Rights [26:14] • About MITRE and the MITRE Atlas [37:19] • What a systems engineer does [54:11] Additional materials: www.superdatascience.com/795
Trends in open-source AI: Join Jon Krohn and a panel of data science icons as they discuss the most exciting and concerning developments in open-source AI. Hear insights from Drew Conway, Jared Lander, Emily Zabor, and JD Long on the transformative potential of AI and its future impact. Additional materials: www.superdatascience.com/794 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Bayesian methods take the spotlight in this episode with Alex Andorra, co-founder of PyMC Labs, and Jon Krohn. Learn how Bayesian techniques handle tough problems, make the most of prior knowledge, and work wonders with limited data. Alex and Jon break down essentials like PyMC, PyStan, and NumPyro libraries, show how to boost model efficiency with PyTensor, and talk about using ArviZ for top-notch diagnostics and visualizations. Plus, get into advanced modeling with Gaussian Processes. This episode is brought to you by Crawlbase, the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • Practical introduction to Bayesian statistics [04:54] • Definition and significance of epistemology [17:52] • Explanation of PyMC and Monte Carlo methods [27:57] • How to get started with Bayesian modeling and PyMC [34:26] • PyMC Labs and its consulting services [50:50] • ArviZ for post-modeling diagnostics and visualization [01:02:23] • Gaussian processes and their applications [01:09:02] Additional materials: www.superdatascience.com/793
Jon Krohn shares his favorite clips from May. Hear how Navdeep Martin is spearheading a company to tackle the climate crisis, why Sol Rashidi and Demetrios Brinkmann find nailing job titles so necessary in the fast-paced industries of tech and AI, and get the latest on embeddings with Luis Serrano. Additional materials: www.superdatascience.com/792 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to Jon Krohn about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education. This episode is brought to you by AWS Inferentia (go.aws/3zWS0au) and AWS Trainium (go.aws/3ycV6K0), and Crawlbase (crawlbase.com), the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Why it is important that AI is open [03:13] • The efficacy and scalability of direct preference optimization [07:32] • Robotics and LLMs [14:32] • The challenges to aligning reward models with human preferences [23:00] • How to make sure AI’s decision making on preferences reflect desirable behavior [28:52] • Why Nathan believes AI is closer to alchemy than science [37:38] Additional materials: www.superdatascience.com/791
The experts reveal their top open-source R libraries with us live from the New York R Conference! This Super Data Science Podcast episode features an exclusive panel with data science trailblazers Drew Conway, Jared Lander, Emily Zabor, and JD Long. They share their favorite R libraries and valuable insights to enhance your data science practice. Additional materials: www.superdatascience.com/790 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Machine Learning for Wind Energy is front and center in this episode as Jon Krohn is joined by Dr. Jason Yosinski, CEO of Windscape AI. Dr. Yosinski brings to light the latest ML advancements sparking significant changes in renewable energy. Tune in for a comprehensive review of these cutting-edge technologies and their expansive impact on the industry and the environment's well-being. This episode is brought to you by Crawlbase, the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Enhancing predictability in wind energy with ML [04:52] • Data utilization from wind turbines by energy providers [11:41] • Jason's journey into wind energy [17:55] • Landing the right startup idea [22:47] • Visualizing neural networks with the Deep Vis Toolbox [31:29] • Extreme event forecasting at Uber vs. nowcasting at Windscape AI [45:13] • Discoveries from Loss Change Allocation research [47:48] • Engaging with Jason's ML Collective [59:46] • Traits of successful AI entrepreneurs [1:10:26] Additional materials: www.superdatascience.com/789
Multi-agent systems could mark a significant turning point in generative AI. From mastering increasingly complex tasks to getting LLMs to collaborate, in this Five-Minute Friday, Jon Krohn discusses the systems that are working to bridge the remaining gaps left by the latest large language models (LLMs). Additional materials: www.superdatascience.com/788 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
MLOps, how to build an online community, and tools for scaling LLMs: In this episode, Demetrios Brinkmann speaks to Jon Krohn about the similarities and differences between LLMOps, MLOps and DevOps, and why this should matter to companies looking to hire such engineers. You will also hear how to get involved in the MLOps community wherever you are in the world, and how you can start developing great products with the available tools. This episode is brought to you by AWS Inferentia (go.aws/3zWS0au) and AWS Trainium (go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • What MLOps is [03:51] • About LLMOps [12:06] • About LlamaIndex and Ollama [18:29] • Insights from Demetrios’ MLOps survey [20:49] • Guidance for using third-party APIs [40:18] • Recommendations for building an online community in tech and AI [47:07] Additional materials: www.superdatascience.com/787
Learn about the six keys to data science success as host Jon Krohn welcomes back Kirill Eremenko, the mastermind behind SuperDataScience. Kirill shares his top insights on data science careers, from building strong portfolios to leveraging mentors and hands-on labs. With over 2.7 million students, his advice is a must-hear for aspiring and experienced data scientists alike.Additional materials: www.superdatascience.com/786Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Dr. Luis Serrano from the Serrano Academy reveals how to make Math and Quantum ML accessible, tackles the challenges of teaching A.I. to beginners, and explores the power of embeddings in enterprise applications. Explore the future of Quantum Machine Learning and the latest trends in AI, including multimodality and autonomous systems.This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How math and AI can be made easy to understand [05:21]• The three major categories of learners [16:21]• Why embeddings are the most important component of LLMs [26:19]• How semantic search differs from a traditional keyword search [29:57]• The most exciting emerging application areas for AI [42:41]• The promising application areas for Quantum Machine Learning [49:18]Additional materials: www.superdatascience.com/785
Aligning LLMs: How can we teach pre-trained LLMs to hold a conversation and learn new information from each other? This was where Sinan Ozdemir began his investigation into aligning LLMs. In this episode, he talks to Jon Krohn about the limitations of definitions for LLMs, training LLMs, and whether it is possible to train an LLM without alignment.Additional materials: www.superdatascience.com/784Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.