Episode 5 | 57 Min | March 13

Exploring reinforcement learning with MIT Professor Vivek Farias

Share on

Engaging topics at a glance

00:16:50
What is Reinforcement Learning
00:20:10
Reinforcement Learning for LLMs
00:24:00
How do you reward your model?
00:33:00
Revealed preferences v/s just a few individuals doing that
00:36:00
AI model training AI in the future?
00:40:18
Methodologies other than Reinforcement Learning
00:43:10
Considerations when in the Reinforcement Learning with Human Feedback (RLHF) Phases
00:48:10
About Cimulate

“Exploring Reinforcement Learning” with guest Vivek Farias, Professor, MIT, discusses what role reinforcement learning has to play in this world of Artificial Intelligence.

Learning systems with humans date back to almost 5,000 years ago. And these learning systems have what allowed us to progress as a society. Being able to teach other people what we know and share knowledge has been the foundational pillars of our evolution and civilization. And interestingly, these learning systems are not unique to just humans. Animals also have these learning systems. When you look at orcas, dolphins, the higher-order intelligent animals spend time training and teaching their young ones. In the last 50 to 60 years, we have not just been teaching humans how to learn, but we have been teaching machines how to learn. And this artificial intelligence area has benefited from our understanding of these learning systems.

Reinforcement Learning is the agent interacts with the world, the world does something to the agent.

– Vivek Farias

The guest started with highlighting the importance of acknowledging uncertainty and balancing between exploiting what is known and exploring to learn more about the environment. This problem is referred to as a “multi-arm bandit problem” and is considered fundamental in reinforcement learning, where the goal is to optimize actions in an environment.

When looking at it specifically for Large Language Models (LLMs) the role of Reinforcement Learning. RL has played the central role in building general purpose chatbots that are based on LLMs. Because the resulting model that has been trained on data might not give you the refined output that you are expecting from it.

The idea is that, listen, there are so many uncertain things in my environment. If I, I don’t acknowledge uncertainty altogether, I may get into this trap where I never learn.

– Vivek Farias

When discussing about rewards and losses in reinforcement learning phase, it came out that the way we structure rewards and penalties for AI models greatly influences their reliability, how they interact with public and the accountability.

Overall deploying AI involves a balance. Backend deployment offers some level of predictability, while front-end deployment is uncertain. Successful business must experiment and capitalize in both aspects.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Vivek Farias

Professor at MIT

Vivek Farias is a Professor at MIT where he teaches hands on deep learning. He is the Co-Founder and CTO of Cimulate. He also Co-Founded Celect (acquired by Nike). He received his PhD in electrical engineering from Stanford University.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Go beyond the Unpacked AI: Explore references

Reinforcement Learning 101

Learn More

Our podcast on your favourite platforms. Explore now!

Episode 1 | 36 Min | March 13

Why AI hallucinates and why it matters with Ankur Taly, scientist at Google

Share on

Engaging topics at a glance

00:00:20
Introduction
00:10:36
Why do models make mistakes and why is it called AI hallucinations?
00:13:31
How does a model know which relationships are meaningful and not?
00:16:12
Things enterprise leaders should keep in mind while deploying LLMs
00:18:14
How does grounding address these AI hallucinations?
00:21:53
How much is grounding going to solve the hallucination problem?
00:24:47
Does hallucinatory capability drive innovation?

Join us in this episode featuring Ankur Taly, Staff Research Scientist, Google, as we explore the concept of grounding of LLMs!

Machines are supposed to work without mistakes, just like a calculator does math correctly. But in the world of artificial intelligence, errors, often called 'AI hallucinations,' are common. This makes us wonder about these mistakes and the computer programs behind them. For businesses that use AI in their work, especially when dealing with customers, making sure AI works without errors is very important.

Understanding how AI makes decisions and being clear about its processes is very important. Business leaders need to be able to watch and explain how AI makes decisions. This will be crucial for using AI in their companies in the future.

To fight AI hallucinations, grounding is important. Grounding means making sure AI answers are based on real facts. This involves teaching AI systems using correct and reliable information and making them give answers that can be proven. Grounding stops AI from making things up or giving wrong information.
When businesses use LLMs (large language models) in their work, they should think about some important things. First, they need to use good data to teach AI because bad data can lead to wrong or unfair results. It's also important to have rules about how AI is used in the company to avoid causing harm or misusing AI information.

Businesses also need to keep an eye on AI's results to fix mistakes or wrong information. Having people check and filter AI's work ensures that it's correct and consistent. It's also important to teach employees and users about what AI can and can't do to avoid misunderstandings or misuse.

Even though AI hallucinations can be a problem, they can also have some positives. They can make people think creatively and find new solutions to tough problems. AI's imaginative ideas can be fun, offering new types of art and media. Plus, AI hallucinations can help with learning by making people think and talk about interesting topics.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Ankur Taly

Scientist at Google

Ankur Taly is currently a Staff Research Scientist at Google. He focuses on studying how AI works, especially in relation to grounding and AI hallucinations. He's deeply involved in understanding AI's explainability and has co-authored an important algorithm called Integrated Gradient which helps explain how AI models function and make predictions, and it's quite popular in the field.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Go beyond the Unpacked AI: Explore references

Identifying and Mitigating the Security Risks of Generative AI

Learn More

Explainable AI in industry: Practical challenges and lessons learned

Learn More

Our podcast on your favourite platforms. Explore now!

Episode 7 | 45 Min | March 13

How AI will impact your business with Harvard Professor, Shikhar Ghosh

Share on

Engaging topics at a glance

00:10:30
Introduction
00:13:35
Why AI is so disruptive?
00:16:30
How businesses and governments accept this new reality?
00:19:20
How enterprise leaders should approach the AI transformation?
00:21:40
New business models shaped with AI
00:27:15
Emotions, decisions, and algorithms
00:34:35
Are we ready yet?

Join us in this episode featuring Shikhar Ghosh, Professor, Harvard Business School, as we explore how AI can fundamentally impact business and society!

In the ever-evolving landscape of technology, artificial intelligence stands as a true disruptor, poised to reshape not only our businesses but also the very fabric of society. In a captivating podcast discussion with Shikhar Ghosh, Harvard Business School professor, we delve deep into the riveting world of AI, exploring why its impact is so seismic, how enterprise leaders should navigate this new frontier, the question of human relevance in the age of AI, and whether we are truly prepared for this transformative journey.

We will uncover the essence of AI's disruptive power and provide compelling insights into the sheer transformation that AI can herald.

Be prepared to be guided through the stormy seas of AI influence on businesses. Our expert highlights the critical importance of a well-defined AI approach. Enterprise leaders must be agile and proactive, recognizing that AI is not merely a tool but a transformational force. We will discuss how to approach AI with an open mindset, viewing it as a catalyst for innovation rather than just a threat.

We will also see why leaders should maximize the upside of AI. This underscores the value of human-machine collaboration, emphasizing that AI augments human capabilities rather than replacing them entirely. It's a matter of harnessing AI's analytical prowess to inform decision-making and free up human resources for more creative and strategic pursuits.

One of the most intriguing segments of the podcast explores the question that lingers in the minds of many: Will humans remain relevant in the age of AI? This is discussed with nuances that business leaders can take a leaf from and be proactive in embracing AI wisely and effectively.

In a world teetering on the precipice of AI-driven transformation, this podcast offers a compelling exploration of why AI is the disruptive force of our era. It presents an alluring narrative that transcends the technical jargon, making the topic accessible and engaging for both the tech-savvy and those new to the AI landscape. As we listen to Professor Shikhar’s captivating insights, we are left with a resounding question: Will we embrace AI as a catalyst for positive change, or will we be swept aside by its inexorable tide of disruption? The answer may very well determine the fate of businesses and society as we know it. Find out more, tune in to the full podcast and embark on a journey into the future of AI, business, and our shared human experience.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Shikhar Ghosh

Professor at Harvard Business School

Shikhar Ghosh is a Professor at Harvard Business School. He has taught several courses on entrepreneurship . Shikhar has been a successful entrepreneur for the last 20 years. He has been the founder and CEO or Chairman of eight technology-based entrepreneurial companies.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Go beyond the Unpacked AI: Explore references

Replika: Embodying AI

Learn More

InstaDeep: AI Innovation Born in Africa

Learn More

Exploring reinforcement learning with MIT Professor Vivek Farias

Go beyond the Unpacked AI: Explore references

Our podcast on your favourite platforms. Explore now!

Latest podcasts

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

How to deploy AI sustainably with Dr. Eng Lim Goh, SVP at HPE

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Building prototypes and pilots using generative AI with Mark Donavon, Nestlé Purina

Are LLMs the Answer to Everything with Prof. Mausam, IIT Delhi

How AI will impact your business with Harvard Professor, Shikhar Ghosh

Develop GenAI Strategy for your organization with AI Scientist, Omid Bakhshandeh