Episode 24

The Evolution of GenAI: From GANs to Multi-Agent Systems

Martin Musiol

Transcript

Description

Show Notes

Resources

Episode Summary –

Early Interest in Generative AI

Martin’s initial exposure to Generative AI in 2016 through a conference talk in Milano, Italy, and his early work with Generative Adversarial Networks (GANs).

Development of GANs and Early Language Models since 2016

The evolution of Generative AI from visual content generation to text generation with models like Google’s Bard and the increasing popularity of GANs in 2018.

Launch of GenerativeAI.net and Online Course

Martin’s creation of GenerativeAI.net and an online course, which gained traction after being promoted on platforms like Reddit and Hacker News.

Defining Generative AI

Martin’s explanation of Generative AI as a technology focused on generating content, contrasting it with Discriminative AI, which focuses on classification and selection.

Evolution of GenAI Technologies

The shift from LSTM models to Transformer models, highlighting key developments like the “Attention Is All You Need” paper and the impact of Transformer architecture on language models.

Impact of Computing Power on GenAI

The role of increasing computing power and larger datasets in improving the capabilities of Generative AI

Generative AI in Business Applications

Martin’s insights into the real-world applications of GenAI, including customer service automation, marketing, and software development.

Retrieval Augmented Generation (RAG) Architecture

The use of RAG architecture in enterprise AI applications, where documents are chunked and queried to provide accurate and relevant responses using large language models.

Technological Drivers of GenAI

The advancements in chip design, including Nvidia’s focus on GPU improvements and the emergence of new processing unit architectures like the LPU.

Small vs. Large Language Models

A comparison between small and large language models, discussing their relative efficiency, cost, and performance, especially in specific use cases.

Challenges in Implementing GenAI Systems

Common challenges faced in deploying GenAI systems, including the costs associated with training and fine-tuning large language models and the importance of clean data.

Measuring GenAI Performance

Martin’s explanation of the complexities in measuring the performance of GenAI systems, including the use of the Hallucination Leaderboard for evaluating language models.

Emerging Trends in GenAI

Discussion of future trends such as the rise of multi-agent frameworks, the potential for AI-driven humanoid robots, and the path towards Artificial General Intelligence (AGI).

Martin Musiol LinkedIn – Martin Musiol | LinkedIn

Generative AI – https://generativeai.net/

Martin’s book – Generative AI – Navigating the Course to the AGI Future

Data & AI Magazine – Media – Data Science Talent

Series you might like

"Insuring Non-Determinism”: How Munich RE is Managing AI's Probabilistic Risks

1 Part

AI V Humans

2 Parts

Data Strategy Evolved: How the Biological Model fuels enterprise data performance

1 Part

Deep Fakes

2 Parts

Enhancing GenAI with Knowledge Graphs: A Deep Dive

1 Part

Enterprise Data Architecture in The Age of AI - How To Balance Flexibility, Control and Business Value

1 Part

Future AI Trends: Strategy, Hardware & AI Security at Intel

1 Part

How AI Is Driving The Eradication Of Malaria

1 Part

How AI is Reshaping Startup Dynamics and VC Strategies

1 Part

How AI is transforming data exploration and visualisation in the enterprise

1 Part

How Observability is Advancing Data Reliability and Data Quality

1 Part

How Science is (mis)communicated in Online Media

1 Part

How to Leverage Data For Exponential Growth

1 Part

How to Use Neural Networks

2 Parts

How XPRIZE is enabling AI for social good

1 Part

Image Processing

1 Part

Key Principles For Scaling AI In Enterprise: Leadership Lessons

1 Part

Mapping forests: Verifying carbon offsetting with machine learning

1 Part

Maximising the Impact of Your Data & AI Consulting Projects

1 Part

Predicting the Next Financial Crisis

1 Part

The Evolution of GenAI: From GANs to Multi-Agent Systems

1 Part

The future of LLMs, ELMs and the semantic layer

1 Part

The Path to Responsible AI

1 Part

The Pitfalls of Using AI Systems for Hiring - Julia Stoyanovich, NYU

1 Part

Transforming Freight Logistics with AI and Machine Learning

1 Part

Using Open Source LLMs in Language for Grammatical Error Correction (GEC)

1 Part

Using Time Series Analysis to Uncover Why Gun Sales Increase After Mass Shootings

1 Part

Why Evolutionary Biology Has Big Implications For Future AI Development

1 Part

Transcript

Speaker Key:

DD Damien Deighan

PD Philipp Diesinger

MM Martin Musiol

00:00:00

DD: This is the Data Science Conversations Podcast with Damien Deighan and Dr. Philipp Diesinger. We feature cutting edge data science and AI research from the world’s leading academic minds and industry practitioners so you can expand your knowledge and grow your career. This podcast is sponsored by Data Science Talent, the data science recruitment experts. Welcome to the Data Science Conversations podcast. My name is Damien Deighan, and I’m here with my co-host, Philipp Diesinger. How’s it going, Philipp?

PD: It’s going well, Damien. Thanks. Pleasure to be here.

DD: Today we’re talking to Martin Musiol about how to build successful GenAI products in the business world by way of quick introduction. Martin’s academic background is in engineering and computer science, and he’s been coding since 2006. His professional experience includes working for some of the world’s leading companies, including household names such as IBM, Airbus and Mphasis. He also got involved in the startup world and has had in the past his own NLP startup. And of course, he has been working in the field of Generative AI since 2016, which is an absolute lifetime in this

00:01:22

discipline. He’s also the creator of the World’s first online course in Generative AI, and his book on the subject was published by Wiley.

He’s also the organizer of the Python Machine Learning Meetup in Munich, and the creator of the Influential Newsletter, Generative AI – Short & Sweet, which now has over 40,000 subscribers. So, I can safely say that we have with us today one of the most preeminent practitioners in the field of Generative AI. Martin, we can’t wait to talk to you. Thanks for joining us. And how are you doing?

MM: Thank you so much for having me. I’m looking forward to the conversation.

DD: So, if we start back in 2016, that was long before anyone was talking about Generative AI, what was it that made you get interested all the way back in 2016?

MM: So, 2016, I had a conference talk actually in Milano, Italy, which was titled Generative AI: How the New Milestones in AI Improved the Products and Services we Built. And I was at that time at Flock Design, a design consultancy. That was my first job, I was a data scientist and coming out of university. And I stumbled upon a paper in 2014, actually, Generative Adversarial Networks. That was the vanilla-GAN, that came out by Ian Goodfellow. And it wasn’t back of my head, and then I thought a lot about it.

And in that design world that I got exposed, I saw, and then also things were popping up. Like first results of GAN, yeah, actual visual results. And I saw, hey, you can actually generate images with it. At that time, the images were

00:02:56

very bad, but it was clear for me though that in some time what the timeframe is, is unclear. But at some time, this will be indistinguishable to real humans or to real animals, or whatever is being generated. So, I started taking this and talked about it. And I also took the concept of 3D object generation and a couple of other fields, and talked about that frame in the design context.

DD: So, very much in the right place, in the right time, I guess, given that was one of the early use cases. What happened after those first couple of years and when did you really start to see this technology come of age?

MM: Talking about Generative AI, it created a local buzz on that specific conference, t was a data driven innovation conference. And it was really interesting, had engaging conversations, but then it faded out because there was no proper business value behind it. Also, it was not about text generation, it was mostly about visual stuff that has been generated. Because language models were not that good at that time. 2017, the first really impressive language model came out, Bard by Google, at that time I was at IBM. We actually used that Bard model to implement a specific use case for a client in the geological context. 2018, I saw that there are some tractions coming up in the papers, more and more GANs. GAN papers are being submitted and lots of different use cases also on the language model generation side.

So, I decided to, A, build the Generative AI net webpage, and B, build an online course. Yeah. I teased it on Reddit back then and Hacker News, and it

00:04:43

went a bit viral. Not too much, but a bit viral beyond my expectations. And so, I had a large email list, I decided, okay, yeah, there is actually interest in it, and I went for it. A lot of things have happened, yeah, but I think it really got pushed into the main attention I would say. The larger scale attention of Generative AI was, I saw the first bump on my webpage analytics. The first bump I saw for Dall-E, which was in the summer before ChatGPT came out, in the summer of 2022. And there was already like an exponential bump of my webpage, generativeai.net. And then with ChatGPT in December, same year, that was then again, much more exponential and many things have happened since then. Yeah.

DD: So, you got there even before Coursera, on the GenAI online courses?

MM: Yeah, [laughter]. To my knowledge, yes.

PD: Great. So, Martin, in your view, how would you define the term Generative AI? What does it mean? I think it’s something that has changed a lot over time, right? It has gone through an evolution, like you already mentioned. But in your perspective, what are the defining factors of Generative AI?

MM: There are many different perceptions. How I would describe it is there are models that can discriminate between different options or selections. We talk about, this is more the traditional part of AI machine learning. Where we have classification, where we have regression, reinforcement learning is also part of it, dimensionality reduction. So, there is like a whole bunch of different Discriminative AI use cases. And the driving AI now is the Generative AI models where we generate actually something. We generate text from

00:06:42

scratch, images, videos, if you have images in sequence, 3D objects, yeah, and so much more.

I think it’s also interesting to see like when we want to judge someone, it’s easy to judge, but actually having a certain skill is really hard. And I think in the same way we can look at Discriminative AI, which was early on like judging between different options. That’s relatively easy compared to actually building data, sequential or parallel data. So, that’s why I also think there is the shift between when Discriminative AI was more dominant in 2014, 2016. And of course, still has use cases, recommendation engines for instance, and so forth. Yeah. They drove so much revenue globally and now shifted Generative AI. And I think Generative AI is not even close to its potential, what it can do.

PD: So, if I hear you correctly, you’re basically saying it’s a new type of AI capability that focuses on generating some form of content or information or so.

MM: Correct. Yeah.

PD: And we already talked about that you have been part of this journey from the very beginning. From your perspective, could you take us to the evolution of GenAI that you have been witnessing? Like we already mentioned some of the key moments, like talking about Transformers, talking about attention. You mentioned 2017, there was a famous paper called Attention Is All You Need. How did these key events shape the evolution of GenAI? And how did you experience it?