Episode 5 | 57 Min | February 05

Exploring reinforcement learning with MIT Professor Vivek Farias

Share on

Engaging topics at a glance

00:16:50
What is Reinforcement Learning
00:20:10
Reinforcement Learning for LLMs
00:24:00
How do you reward your model?
00:33:00
Revealed preferences v/s just a few individuals doing that
00:36:00
AI model training AI in the future?
00:40:18
Methodologies other than Reinforcement Learning
00:43:10
Considerations when in the Reinforcement Learning with Human Feedback (RLHF) Phases
00:48:10
About Cimulate

“Exploring Reinforcement Learning” with guest Vivek Farias, Professor, MIT, discusses what role reinforcement learning has to play in this world of Artificial Intelligence.

Learning systems with humans date back to almost 5,000 years ago. And these learning systems have what allowed us to progress as a society. Being able to teach other people what we know and share knowledge has been the foundational pillars of our evolution and civilization. And interestingly, these learning systems are not unique to just humans. Animals also have these learning systems. When you look at orcas, dolphins, the higher-order intelligent animals spend time training and teaching their young ones. In the last 50 to 60 years, we have not just been teaching humans how to learn, but we have been teaching machines how to learn. And this artificial intelligence area has benefited from our understanding of these learning systems.

The guest started with highlighting the importance of acknowledging uncertainty and balancing between exploiting what is known and exploring to learn more about the environment. This problem is referred to as a "multi-arm bandit problem" and is considered fundamental in reinforcement learning, where the goal is to optimize actions in an environment.

When looking at it specifically for Large Language Models (LLMs) the role of Reinforcement Learning. RL has played the central role in building general purpose chatbots that are based on LLMs. Because the resulting model that has been trained on data might not give you the refined output that you are expecting from it.

When discussing about rewards and losses in reinforcement learning phase, it came out that the way we structure rewards and penalties for AI models greatly influences their reliability, how they interact with public and the accountability.

Overall deploying AI involves a balance. Backend deployment offers some level of predictability, while front-end deployment is uncertain. Successful business must experiment and capitalize in both aspects.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Vivek Farias

Professor at MIT

Vivek Farias is a Professor at MIT where he teaches hands on deep learning. He is the Co-Founder and CTO of Cimulate. He also Co-Founded Celect (acquired by Nike). He received his PhD in electrical engineering from Stanford University.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Go beyond the Unpacked AI: Explore references

Reinforcement Learning 101

Learn More

Our podcast on your favourite platforms. Explore now!

Episode 9 | 56 Min | February 05

Building prototypes and pilots using generative AI with Mark Donavon, Nestlé Purina

Share on

Engaging topics at a glance

00:11:20
Introduction
00:16:30
How does the market mindset help in conceptualizing ideas?
00:19:00
Consumer research, design, and prototype for AI-based products
00:22:40
Data sources and models used in early product development
00:25:35
When to feed data into AI model?
00:28:32
When to take the prototype to production?
00:37:35
ML models used during prototyping
00:40:46
Generative AI in your products
00:43:05
Testing early models
00:45:25
Grounding models
00:47:20
Key insights

Join us in this episode where our guest Mark Donavon, Head of Digital Strategy and Ecosystem Development at Nestle Purina PetCare shares his real-life experiences and insights to explore what it takes to understand and build prototypes and pilots using AI.

This podcast gives insights into how a pet care organization harnesses the power of AI and IoT technologies to enhance pet welfare. The discussion centers on innovative problem-solving and the considerable potential for AI applications in the pet care domain.

The podcast opens by highlighting the importance of allowing technology to be driven by problems and needs rather than dictating solutions. The emphasis is on understanding specific user groups and comprehending the challenges faced by pet owners. Instead of beginning with existing technology and searching for problems to solve, their approach revolves around understanding the needs of end users and subsequently exploring how technology can address these issues. This user-centric approach is a cornerstone of their organization, reinforcing their commitment to developing products tailored to pet owners' requirements.

The conversation then pivots to the process of understanding user needs. The organization conducts consumer research, with variations across regional divisions. Each division maintains its own consumer insight team working closely with external agency partners to gather research data. Their digital team collaborates with these divisions, allowing them to access consumer insights that might not be uncovered through traditional research methods. This highlights the adaptability of their company and the synergistic relationship between divisions.

The podcast proceeds to discuss the practical application of AI and IoT technologies. An example is presented: a smart litter box equipped with IoT capabilities that utilizes AI to provide valuable insights. The aim is to detect early signs of kidney disease in cats, a common yet often undiagnosed ailment. The organization saw an opportunity to intervene earlier by identifying changes in a cat's bathroom behavior that correlate with an increased risk of the disease. This innovative device provides pet owners and veterinarians with early warning indicators, effectively transforming the approach to cat health.

The speaker underscores how the smart litter box is revolutionizing pet care. Traditional practices often involve diagnosing the disease at advanced stages, making it challenging for veterinarians to do more than manage symptoms. However, this device alerts pet owners to subtle behavioral changes, enabling early intervention and potentially life-saving treatments.

The journey toward developing this ground-breaking device is then explored. It began with a low-fidelity prototype, using a simple mechanical device to record data when a cat entered the litter box. This provided initial insights into behavioral patterns. Subsequently, more sensors and technologies were integrated, resulting in the current iteration of the smart litter box. The speaker stresses the importance of combining various sensors to collect comprehensive data for diagnosing specific behaviors and patterns in cats, thus facilitating early detection of health issues.

The podcast also delves into AI models, which are employed to gain a deeper understanding of pet behavior. Early prototypes collected data on behavioral patterns but could not interpret the cat's actions within the litter box. To address this limitation, machine learning models were incorporated. These models were trained to distinguish between various behaviors, such as urination, defecation, and digging. This enhanced the system's ability to provide meaningful insights, enabling the early detection of potential health issues by interpreting the pet's actions within the litter box.

In response, a point is made regarding the flexibility and adaptability of AI models. It's crucial to allow machine learning models to evolve and adapt since pets may exhibit diverse behaviors. This flexibility aligns with the organization's commitment to accumulating extensive data and generating high-quality training data to enhance their systems.

The discussion then touches upon the challenges of introducing innovative technologies within an established company. The speaker describes the initial hurdles they faced when convincing management to invest in these new technological directions. Skepticism and questions about the impact on pet food sales were common concerns. Yet, by presenting real-world data, success stories, and tangible outcomes, they were able to build a compelling case and garner support for their projects over time.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Mark Donavon

Head of Digital Strategy and Ecosystem Development at Nestlé Purina PetCare

Mark Donavon serves as the Head of Digital Strategy and Ecosystem Development at Nestlé Purina PetCare. With a dynamic leadership style and a history of delivering innovative products for global business growth, he specializes in digital pet products and AI-driven insights from pet data. His mission is to enrich the lives of pets and their beloved owners.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Go beyond the Unpacked AI: Explore references

Digitalization and R&D in Petcare with Mark Donavon, Nestle Purina Petcare

Learn More

AI-Powered Litterbox System Offers New Standard of Care for Cat Owners

Learn More

Unpacked with Arvind Ravishunkar

Our podcast on your favourite platforms. Explore now!

Latest podcasts

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

How to deploy AI sustainably with Dr. Eng Lim Goh, SVP at HPE

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Building prototypes and pilots using generative AI with Mark Donavon, Nestlé Purina

Are LLMs the Answer to Everything with Prof. Mausam, IIT Delhi

How AI will impact your business with Harvard Professor, Shikhar Ghosh

Develop GenAI Strategy for your organization with AI Scientist, Omid Bakhshandeh