Episode 4 | 53 Min | March 13

Performance and choice of LLMs with Nick Brady, Microsoft

Share on

Engaging topics at a glance

00:12:23
Introduction
00:14:20
Current use cases being deployed for GenAI
00:19:10
Performance of LLM models
00:36:15
Domain Specific LLMs vs General Intelligence LLMs
00:38:37
How to choose the right LLM?
00:41:27
Open Source vs Closed Source
00:44:50
Cost of LLM
00:46:10
Conclusion

"Exploring what should organization considering when choosing to adopt LLMs" with guest Nick Brady, Senior Program Manager at Microsoft Azure Open AI Service

AI has been at the forefront of transformation for more than a decade now. Still, the Open AI launch of chat GPT in November 2022 will be noted as a historical moment – the scale of which even Open AI did not expect – in the history of technological innovations. Most people don’t realize or fully appreciate the magnitude of the shift that we’re in. Now, we’re able to directly express to a machine a problem that we need to have solved; equipping these technologies with the right reasoning engines and the right connectivity could bring the biggest technology leapfrog not just for enterprises but even in everyday lives.

The onset of leapfrog does bring out a few questions for enterprises looking to adopt GenAI as a part of their strategy, operations and way ahead, like:

if we equip these technologies with the right reasoning engines and the right connectivity to this work and the things that we do every single day, this could be the single most democratising force of technology like the world has ever seen, not just in enterprise but even in our personal lives.rn

– Nick Brady

What use cases are best suited to adopt the models?

While most customers are looking for how this could reduce business costs in their organizations, the true value is when it is used to maximize business value productivity and downstream that could lead to employee satisfaction and customer satisfaction. Any place where there’s language – programming or natural language – is a good use case for generative AI, and that probably would be the most profound shift. So, if you have language, if you have a document, if you have big data where you’re trying to sort of synthesize, understand what that content and what the content is, generative AI models can do this ad nauseam without any delay.

The interesting thing about parameters is it’s actually not a direct correlation to how powerful the model might be. I mean, the parameter size refers to essentially the number of values that the model can change independently as it learns from data.

– Nick Brady

The most common metric used across the world to describe LLMs is the number of parameters; in the case of GPT 3, it is trained on 175 billion parameters, but what does this mean?

Parameter size refers to essentially the number of values that the model can change independently as it learns from data and stores all information in the vast associative ray of memory as its model weights. What’s perhaps more important for these models, and it speaks to more of their capability, is their vocabulary size.

How does one decide and evaluate which would be the best-suited model for the selected use cases?

The best practice really is to start with the most powerful and advanced language model like GPT 4.0 to test, if it’s even possible, with your use case. Post confirming the possibility of use case trickle down to simpler models to find its efficacy and efficiency. If the simpler model can probably achieve 90% of the way, with just a little bit of prompt engineering, then you could optimize for costs.

But many of LLMs are English only and that’s a real problem, especially for multinational enterprises and organisations that have diverse employees and diverse customers that speak in many different languages.

– Nick Brady

Organizations would have to define what quality means to them. It could be the model’s output, its core response, or performance in terms of latency, where the quality of the output may not be as important as how quickly we can respond back to the user.

The key for leaders is to pay close attention to the potential use cases, test them with the best model and then optimize the model to balance the cost, efficacy and efficiency factors.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Nick Brady

Senior Program Manager at Microsoft Azure Open AI Service

Nick Brady is a Senior Program Manager at Microsoft Azure Open AI Service. He is deeply passionate in his purpose to convince others the transformative impact artificial intelligence will have in improving their lives and the organizations they represent. He completed his Bachelor's in Organization leadership and Supervision and master’s in technology from Purdue University's Knoy School of Technology.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Our podcast on your favourite platforms. Explore now!

Episode 12 | 56 Min | March 13

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

Share on

Engaging topics at a glance

00:16:10
Google Brain and TensorFlow
00:19:10
TensorFlow for AI world
00:21:55
Tooling and infrastructure needs of previous AI models vs. GenAI
00:25:40
Trade-offs, Open-source library and framework vs. private company's framework
00:31:30
Present quality of tool and Infrastructure available for GenAI
00:32:55
How do I build a GenAI team?
00:37:50
Vendors and the Infrastructure available today to take GenAI into production
00:41:51
How to differentiate in the GenAI world?

Join us in exploring the evolving space of GenAI tools and infrastructure, featuring Rajat Monga, Co-Founder of TensorFlow and Google Brain.

A good power tool can make the difference of easily six to seven hours of work when you're doing woodworking. And the world of AI is no different. The types of tools and infrastructure that are developed to help you as enterprise leaders build artificial intelligence products and features are very, very important. In this episode, with our guest's help, we will unpack the infrastructure that surrounds you and the tooling that will help you as enterprise leaders build AI products and services.

We will look at how tooling and infrastructure needs are changing in the world of AI with the increasing adoption of GenAI. One key change that emerged from the talk is that now things have evolved to the extent that we don’t need to train the models from scratch. We already have foundation models available that know our world up to some level, reducing the burden of training the model with tonnes of data. However, models today have become so large that they sometimes don’t fit in a single machine.

As connecting your database with these models is very important, we also discussed the trade-offs between the open-source and private libraries. Should companies manage data on their own or outsource it? When you are not training your model, the easiest and fastest way is to use API, and if you want your data on-prem, then it will mostly cost you more. In the end, it boils down to what core to you is, and often, not all part of the infrastructure is core to companies. So, if your data is not your core strength, then better outsource it.

This episode also uncovered the current tools and infrastructure available for GenAI. The current tools and infrastructure available for large-level deployments are going through a rapid evolution; they are not very hard to rebuild or replace, and new companies are emerging in the tooling and infrastructure for GenAI space.

When looking at the talent and skills needed for GenAI implementation in your organization, it is important to have technically sound people with domain expertise in the organization's particular area.

For the differentiation in the market domain knowledge in your area, relationship with the customers, distribution channel, your execution, etc., today plays an even bigger role. However, in this data-driven world, having proprietary data and knowing how to leverage it can be an added advantage. To find out more, tune in to the full podcast.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Rajat Monga

Co-Founded and was the Ex-Head of TensorFlow

Rajat Monga Co-Founded and was the Ex-Head of TensorFlow. He also Co-Founded Google Brain division and Inference.io. He is currently working in different areas of Artificial Intelligence and is an expert in the area of AI tools and infrastructure. He studied Electrical Engineering at IIT Delhi.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Our podcast on your favourite platforms. Explore now!

Episode 10 | 61 Min | March 13

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Share on

Engaging topics at a glance

00:09:15
Introduction
00:13:40
What is a Large Language Model (LLM)?
00:18:40
Is LLM a form of intelligence?
00:20:25
Comparing how LLMs learn than human learning.
00:22:50
How LLMs differ from one another?
00:27:56
What to consider when choosing LLMs?
00:44:05
Can LLMs retrieve past human knowledge?
00:51:45
How can companies harness power of statistical models?
00:53:05
Key things to keep in Mind when integrating LLM into the business.
00:56:10
Conclusion

Join us in this episode featuring Anupam Datta, Co-founder and Chief Scientist, TruEra, as we dive into the evolution of LLMs and what they hold for the future!

This world of generative AI has caught us by storm. And as enterprise leaders in your companies, understanding the technology behind generative AI will give you a competitive advantage as you plan your companies and businesses. And to help you do this, we will unpack a technology, large language models (LLMs), that powers AI today and represent a paradigm shift in this field of Artificial Intelligence.

LLMs can craft meaningful responses across many domains. Their performance has notably improved recently thanks to the substantial increase in model size and data volume.

With the increasing acceptance of this technology, numerous companies are unveiling various Large Language Models (LLMs). It's important to recognize that opting for the largest or highest-performing LLM isn't always the most suitable approach. Instead, one might prefer LLMs that excel in specific tasks relevant to their application. As a leader in the enterprise, it's crucial to integrate this understanding into your company's strategy, aiding in identifying the appropriate LLMs to match and adapt for your applications. Achieving equilibrium between LLM selection, cost considerations, and latency considerations stands as a pivotal concern for enterprises. Equally essential is the thorough validation and assessment of generative outputs, serving as a safeguard prior to embarking on consequential choices. Hence, the undertaking of reliability testing at this current juncture is paramount.

Furthermore, enterprises need to consider a few other key aspects in this evolving landscape of LLMs as they build out LLMs. Starting with a well-defined business use case that offers real value is crucial. As LLMs move from development to production, it's important to establish thorough evaluations and observability throughout their lifecycle. Education across the organization is vital to implement LLMs effectively. Companies should train their workforce to adapt to this changing technology stack. Fostering a community around responsible AI development and evaluation can contribute to a better understanding and management of LLMs. With these steps, enterprises can navigate the complexities of LLMs and harness their potential for positive impact.

Production Team
Arvind Ravishunkar, Ankit Pandey, Chandan Jha

Guest

Anupam Datta

Co-founder TruEra, and ex-CMU

Anupam Datta is Co-Founder, President and Chief Scientist at TruEra. Datta was on the faculty at Carnegie Mellon University for 15 years, most recently as Professor and Director of the Accountable Systems Lab. He has authored more than 100 papers and has been cited over 9000 times.

Host

Arvind Ravishunkar

GM, Wipro, lab45 think tank, Strategist & Berkeley MBA

Go beyond the Unpacked AI: Explore references

Anupam Datta, TruEra

Learn More

Attention Is All You Need

Learn More

Performance and choice of LLMs with Nick Brady, Microsoft

Our podcast on your favourite platforms. Explore now!

Latest podcasts

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

How to deploy AI sustainably with Dr. Eng Lim Goh, SVP at HPE

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Building prototypes and pilots using generative AI with Mark Donavon, Nestlé Purina

Are LLMs the Answer to Everything with Prof. Mausam, IIT Delhi

How AI will impact your business with Harvard Professor, Shikhar Ghosh

Develop GenAI Strategy for your organization with AI Scientist, Omid Bakhshandeh

Exploring reinforcement learning with MIT Professor Vivek Farias

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

Our podcast on your favourite platforms. Explore now!

Top trending insights

How to deploy AI sustainably with Dr. Eng Lim Goh, SVP at HPE

Hey AI, welcome to the team: Emergence of algorithmic enterprise

The platform ecosystem business model: A blueprint for enterprises. Part 2

Unpacked with Arvind Ravishunkar

The platform ecosystem business model: A blueprint for enterprises. Part 1

GenAI revolutionizing banking: Opportunities and challenges

Healthcare trends: Disruptions and innovation

How AI will impact your business with Harvard Professor, Shikhar Ghosh

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Go beyond the Unpacked AI: Explore references

Our podcast on your favourite platforms. Explore now!

Co-create for collective wisdom

Our podcast on your favourite platforms. Explore now!

Latest podcasts

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

How to deploy AI sustainably with Dr. Eng Lim Goh, SVP at HPE

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Building prototypes and pilots using generative AI with Mark Donavon, Nestlé Purina

Are LLMs the Answer to Everything with Prof. Mausam, IIT Delhi

How AI will impact your business with Harvard Professor, Shikhar Ghosh

Develop GenAI Strategy for your organization with AI Scientist, Omid Bakhshandeh

Exploring reinforcement learning with MIT Professor Vivek Farias

Uncovering GenAI tools and infrastructure with Rajat Monga, Co-Founder, TensorFlow

Our podcast on your favourite platforms. Explore now!

Top trending insights

How to deploy AI sustainably with Dr. Eng Lim Goh, SVP at HPE

Hey AI, welcome to the team: Emergence of algorithmic enterprise

The platform ecosystem business model: A blueprint for enterprises. Part 2

Unpacked with Arvind Ravishunkar

The platform ecosystem business model: A blueprint for enterprises. Part 1

GenAI revolutionizing banking: Opportunities and challenges

Healthcare trends: Disruptions and innovation

How AI will impact your business with Harvard Professor, Shikhar Ghosh

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

Go beyond the Unpacked AI: Explore references

Our podcast on your favourite platforms. Explore now!

Co-create for collective wisdom

Wait a moment!

Generative AI on the cusp of disruption

Are LLMs the Answer to Everything with Prof. Mausam, IIT Delhi

What you should know about LLM’s with Anupam Datta, Co-founder TruEra, and ex-CMU

What will it mean to be a “Human Being” in 2100?

Join Think Tank’s global launch broadcast on April 30

Guy Kawasaki Subha Tatavarti

Unlock exclusive insights!

Guy Kawasaki
Subha Tatavarti