
Kabir's Tech Dives
I'm always fascinated by new technology, especially AI. One of my biggest regrets is not taking AI electives during my undergraduate years. Now, with consumer-grade AI everywhere, I’m constantly discovering compelling use cases far beyond typical ChatGPT sessions.
As a tech founder for over 22 years, focused on niche markets, and the author of several books on web programming, Linux security, and performance, I’ve experienced the good, bad, and ugly of technology from Silicon Valley to Asia.
In this podcast, I share what excites me about the future of tech, from everyday automation to product and service development, helping to make life more efficient and productive.
Please give it a listen!
Episodes
251 episodes
🤖 Comprehensive Compilation of Free AI Learning Resources
This episode introduces several free online courses and resources aimed at democratizing artificial intelligence education for various audiences, from non-technical individuals to experienced programmers. Elements of AI<...
•
Season 3
•
Episode 24
•
16:09

🗣️ State of AI Voice Generation
This episode offers insights into the landscape of AI voice generation and text-to-speech technology as of 2025. They explore various platforms like PlayAI, NaturalReader, Murf AI, ElevenLabs, and others, highlighting their featur...
•
Season 3
•
Episode 22
•
13:55

⚔️ Global AI Landscape: Competition, Innovation, and Impact
This episode discusses DeepSeek, a Chinese AI startup that has rapidly gained recognition for its innovative and cost-efficient AI models, posing a challenge to established US tech companies like OpenAI. DeepSeek's open-sourc...
•
Season 3
•
Episode 23
•
24:04

🤖 AI Music Generation: Tools, Ethics, and Industry Impact
Several sources from late 2024 and early 2025 discuss the burgeoning field of AI music generation, highlighting various tools and their capabilities. Articles from SoundGuys and whatplugin Reviews evaluate and compare speci...
•
Season 3
•
Episode 21
•
16:04

🎬 AI Video Generation Tools and Trends
This episode explore the rapidly advancing landscape of AI video generation tools. Several YouTube transcripts and articles discuss a variety of platforms, both free and paid, capable of tasks like text-to-video creation, image animation, reali...
•
Season 3
•
Episode 20
•
17:06

🗣️ Dia: New Open Source Text-to-Speech Model
Nari Labs, a two-person startup, has launched Dia, an open-source text-to-speech model. This model, boasting 1.6 billion parameters, is designed to generate natural-sounding dialogue from text, even incorporating emotional tones and nonverbal c...
•
Season 3
•
Episode 19
•
11:31

🐬 DolphinGemma: AI Decodes Dolphin Communication
Google AI has developed DolphinGemma, a new AI model, to help scientists at the Wild Dolphin Project decode the complex communication of Atlantic spotted dolphins. Trained on decades of dolphin vocalization data, DolphinGemma i...
•
Season 3
•
Episode 18
•
10:44

🛡️ Microsoft SFI April 2025 Security Progress Report
This episode is about Microsoft's April 2025 progress report on its Secure Future Initiative (SFI), a comprehensive, multiyear effort to enhance the security of its products and services. The report highlights advancements across var...
•
Season 3
•
Episode 17
•
17:46

LLM Advancements, Applications, and Industry Impact in 2024-2025
This episode explores the current landscape and future trajectory of large language models (LLMs) and generative AI. One document details ten practical applications of LLMs in 2024, highlighting tools like ChatGPT and Grammarly, while another i...
•
Season 3
•
Episode 16
•
16:48

🎬 AI's Impact and Innovations in Video Production
This episode explores the burgeoning field of AI in video production, highlighting advancements like Runway Gen-4's precise camera controls and the emergence of powerful generative models such as OpenAI's Sora and the open-source Open-Sora proj...
•
Season 3
•
Episode 15
•
28:48

🤖 AI Business Risks, Cost Reduction, Defense, Training, and Manus AI
This episode introduce Manus AI, an autonomous AI agent from a Chinese startup, highlighting its ability to execute complex tasks with minimal user input, setting it apart from tools like ChatGPT and DeepSeek. Manus AI boasts mult...
•
Season 3
•
Episode 14
•
19:09

🎬 One-Minute Video Generation via Test-Time Transformer Training
Researchers introduced Test-Time Training (TTT) layers to enhance the ability of pre-trained Diffusion Transformers to generate longer, more complex videos from text. These novel layers, inspired by meta-learning, allow the model's hidde...
•
Season 3
•
Episode 13
•
14:55

The Next Token and Beyond: Unraveling the LLM Enigma
Yes, I can certainly provide a long and detailed elaboration on the topics covered in the sources, particularly focusing on LLM-generated text detection and the nature of LLMs themselves.The emergence of powerful Large Language Models (L...
•
19:27

🤖 AI and Machine Learning: A Multi-Source Overview
This episode provides a comprehensive exploration into the realm of Artificial Intelligence (AI) and Machine Learning (ML), specifically within the context of educational environments. At its core, AI is defined as the simulation of human intel...
•
Season 3
•
Episode 12
•
17:05

🤖 AI Trends and Innovations for 2025
This episode explores the anticipated trajectory of artificial intelligence in 2025, highlighting key trends impacting various sectors. AI agents, capable of autonomous reasoning and action, are a prominent focus across multiple sources....
•
Season 3
•
Episode 11
•
17:52

🥊 AI Giants Compete for College Students
OpenAI and Anthropic are actively competing to become the primary AI tool for college students. Both companies have recently unveiled initiatives aimed at higher education, with Anthropic introducing Claude for Education and OpenAI makin...
•
Season 3
•
Episode 6
•
9:06

🤖 Therabot: AI Chatbot Shows Mental Health Therapy Benefits
Dartmouth researchers conducted a clinical trial of their AI-powered therapy chatbot, Therabot, and found significant mental health improvements in participants with depression, anxiety, and eating disorder risks. The study showed sympto...
•
Season 3
•
Episode 10
•
13:34

📉 Microsoft Adjusts AI Data Center Growth Amid New Trends
Microsoft is reportedly scaling back its ambitious AI data center expansion plans. This decision follows the emergence of new, more cost-effective AI model development methods, particularly from Chinese companies. These methods demonstra...
•
Season 3
•
Episode 9
•
12:52

🚀 Efficient and Portable Mixture-of-Experts Communication
A team of AI researchers has developed a new open-source library to enhance the communication efficiency of Mixture-of-Experts (MoE) models in distributed GPU environments. This library focuses on improving performance and portability...
•
Season 3
•
Episode 8
•
16:59

🤝 Vana: User-Owned AI Models from Decentralized Data
Vana, a decentralized platform originating from an MIT project, aims to shift control of data used for AI training back to individual users. Frustrated by the current model where tech companies profit from user data, Vana allows individuals to ...
•
Season 3
•
Episode 5
•
11:09
.png)
🤖 AI and Copyright: US Copyright Office Report
In a January 2025 report, the U.S. Copyright Office addresses the copyrightability of works created using artificial intelligence. This second part of a broader study examines the level of human contribution necessary for AI-generated ou...
•
Season 3
•
Episode 7
•
20:47

Unleashing Local AI on Your Mac Studio - From Ollama to DeepSeek
Are you intrigued by the power of AI but concerned about privacy or cloud costs? In this episode, dive into the exciting world of running Large Language Models (LLMs) directly on your Mac, iPhone, and iPad! We'll explore how tools like Ollam...
•
Season 3
•
Episode 4
•
14:11

🤖 Agentic AI Courses and Learning Resources
This episode provides information about agentic AI and AI agent courses available in 2025. The courses cover topics like AI fundamentals, building AI agents, prompt engineering, and strategic implementation, catering to diverse skill levels and...
•
Season 3
•
Episode 3
•
15:05

🎭 DreamActor-M1: Hybrid Guided Holistic Human Image Animation
This episode is also about a research paper introducing DreamActor-M1, a new realistic human image animation framework. This DiT-based method utilizes hybrid guidance combining facial representations, 3D head spheres, and body skeletons for fin...
•
Season 3
•
Episode 2
•
19:21

🤖 DreamActor-M1: Human Image Animation
DreamActor-M1 is a new framework for animating human images based on a diffusion transformer, utilizing a hybrid guidance system. This approach enables more precise control over the entire body, adapts to different image scales, and main...
•
Season 3
•
Episode 1
•
14:53
