Hugging Face and watsonx: why open source is the future of AI in business
Hugging Face and watsonx: why open source is the future of AI in business Malcolm Gladwell: Hello, hello. Welcome to Smart Talks with IBM, a podcast from Pushkin Industries, iHeartRadio and IBM. I’m Malcolm Gladwell. This season, we’re continuing our conversation with New Creators – visionaries who are creatively applying technology in business to drive change — but with a focus on the transformative power of artificial intelligence and what it means to leverage AI as a game- changing multiplier for YOUR business. Our guest today is Jeff Boudier [BOO-dee-ay], Head of Product and Growth at Hugging Face, the leading open-source and open- science artificial-intelligence platform. Before getting into the world of open-source AI, Jeff co-founded a company called Stupeflix, a video-editing software which was eventually acquired by GoPro. An engineer
by background, he has a self-professed obsession with the business of technology. Recently, IBM and Hugging Face announced a collaboration, bringing together Hugging Face’s repositories of open-source AI models with IBM’s watsonx platform. It’s a move that gives businesses even more access to AI while staying true to IBM’s long-standing philosophy of supporting open-source technology. With open source, businesses can build better AI models that suit their specific needs, using their own proprietary data, while browsing a ready catalog of pretrained models. In today’s episode, you’ll hear why open source is so crucial to the advancement of AI, how IBM’s watsonx interacts with open-source AI, and Jeff’s thoughts on why the singular, omnipotent AI model is a myth. Jeff spoke with Tim Harford,
host of the Pushkin podcast Cautionary Tales. A longtime columnist at the Financial Times, where he writes the “Undercover Economist,” Tim is also a BBC broadcaster with his show More or Less. OK, let’s get to the interview. Jeff Boudier: Hi, I'm Jeff Boudier,
and I'm a Product Director at Hugging Face. Tim Harford: So I'm immediately intrigued. Hugging Face—is this a reference to the Alien movie, or something else? Jeff Boudier: It is not. And it may be not obvious to a listener, but “Hugging Face” is the name of that cute emoji—you know, the one that's smiling with his two hands extended like that to give you a big hug? That's Hugging Face. So basically we named the company after an emoji. Tim Harford: Okay. And it is—I saw your website, and it is a very friendly emoji. So that's nice. So tell us a little bit about Hugging
Face and, and about what you do there. Jeff Boudier: Of course. Well, Hugging Face is the leading open platform for AI builders, and it's the place that all of the AI researchers use to share their work, their new AI models, and collaborate around them. It's the place where the data scientists go and find those pretrained models and access them and use them and work with them. And increasingly, it's the place where
developers are coming to turn all of these AI models and datasets into their own applications, their own features. Tim Harford: So like the Facebook group or the Reddit or the Twitter for people who are interested in, particularly, generative-language AI, or all kinds of artificial intelligence? Jeff Boudier: All kinds of AI, really. And of course generative AI is this new wave that has caught the world by storm. But
on Hugging Face you can find any kind of model. The new, sort of, transformers models,to do anything from, translation—or if you wanted to transcribe what I'm saying into text, uh, then you would use a transformer model. If you wanted to, then take that text and make a summary, that would be another transformer model. If you wanted to create a nice little thumbnail for the, this podcast by typing a sentence, that would be, uh, another type of model. So all of these models you can find—there's actually,
300,000 that are free and publicly accessible—you can find them on our website at huggingface.co, and use them, using our open-source libraries. Tim Harford: And so this is—this is fascinating. So, so there are 300,000 models. Now, when you say “model,” I'm thinking in my head, “Oh, it's kind of like, um, like a computer program.” There were 300,000 computer programs. Is that—is that roughly right? Or it—not really? Jeff Boudier: It's a general idea. A model is a
giant, set of numbers that are working together to sift through some input that you're going to give it. So think of it as a big black box filled with numbers. And you give it as a, as an input maybe some text, maybe a prompt. So you're asking—you're giving an instruction to the model, or maybe you give it an image as an input, and then it will sift through that information, thanks to all of these numbers, which we call, in the field, “parameters,” and it will produce an output. So when I told you, “Hey, we can transcribe this conversation into text,” the input would have been the conversation in an audio file, and then the output would have been the text of the transcription. If you want to create a thumbnail for this podcast episode, then the input would be what we call the “prompt,” which is really a text description— like, uh, “French man in San Francisco talking about machine learning.” And the output would be a completely original
image. So that's how I think about what an AI model is. And I think what we're starting to realize is that this is becoming the new way of building technology in the world. It has been—for the field of dealing—understanding, generating text—for quite some time. But now it's sort of moving across every field of technology. We have models to create images, as I say, but also to generate new proteins, to make predictions on numerical data. So every kind of field of machine learning is now using this new type of models.
But what's interesting is that if you're,say, a product manager at a tech company, and you say, “Hey, I want to build a feature that does this,” a few years ago the approach would have been to ask a software developer to write a thousand lines of code in order to build a prototype. And a new way of doing things today is to go look for an off-the-shelf, pretrained model that does a pretty good job of solving exactly that problem. So you can create a prototype of that feature fast. So it's a new approach to building tech. Tim Harford: I'm not a programmer, but I'm aware
that there's this idea of open-source code, and now we have open-source models. So what does it mean for something to be “open source”? Jeff Boudier: Yes, “open-source AI” actually means a lot of different, specific things. It's the open-source implementation of the model. So if you use the Hugging Face transformers library to use a model, you're using an open-source code library to use that model. Tim Harford: Just to interrupt on the “transformers”—these are these, kind of, ways of turning a picture of a dog into a text output that says, “Hey, this is a picture of a dog,” or “This is a French text,” and—with the transformers helping you turn it into English text. Or it's doing all
of these things that you've been describing. That's—the, the transformer is the—kind of the engine at the, at the heart of that. Jeff Boudier: Yes, exactly. And we call them “transformers” because they correspond to this new way of building machine-learning models that was introduced by Google, actually, with a very important paper called “Attention Is All You Need.” And it was published in 2017 by researchers out of Google DeepMind. Tim Harford: Wow,
that's just six years. That's so new. Jeff Boudier: It is very new, and ever since, the pace of innovation has really, really accelerated. But it really started from this inflection point that came from this paper and its implementation in what is now called “transformer models.” The transformer—that has conquered every area of machine learning since. Tim Harford: Okay. Sorry to interrupt. So, so—you've got this library of transformer models, and they're open source, and that means, that means what? Anyone can use them for free? Or that, or that anybody can implement them for free? What does it mean? Jeff Boudier: So again, there's, like, lots that go into it, but the most important thing is for the model itself to be available so that a data scientist or an engineer can download them and use them. And also there are a lot of considerations about how you make them accessible. And a very important one is whether or not you give access to the training data—all the
information that went into training that model and teaching it to do what, what it's trained to do. That's, uh— Tim Harford: Have fed millions of words into a, into a language transformer, or I might have fed millions of photographs into a, into a picture transformer. Yeah. Jeff Boudier: Yes. And the accessibility of that training data is very, very important. TIM HARFORD: What's the relationship between the,
the Hugging Face libraries and, uh, GitHub? Which, if I understand GitHub correctly, it's this—the repository of open-source code, lots and lots of lines of code and routines and programs that are, that are shared and updated and tracked, and they're all available on, on GitHub. Which sounds similar to what you're doing with Hugging Face for AI. So what, what is the interaction or the relationship there? JEFF BOUDIER Yeah, I think you nailed it on the head. So Hugging Face is to AI what GitHub is to code, right? It's this central platform where AI builders can go find, and collaborate around, AI artifacts, which are models and datasets. So it's
quite—it's different than software, but we play the central, the central role in the community to share and collaborate and access all of those artifacts for AI like GitHub offers for code. Tim Harford: And that community must be incredibly important. I mean, the—open source is nothing if you don't have a community of people working on it. So how have you been able to foster— Jeff Boudier: Well, I think it goes to the origins of the transformer model, and, and Hugging Face rolled into that. So when the first, sort of, open model came out, it was called BERT,
and it came out of Google. The only way you could access it was to use a tool called TensorFlow. But it happened that most of the AI community was using a different tool, called PyTorch. And something that Hugging Face did is to make that new model, BERT, accessible to all PyTorch users. And they did it in open source. It was a project called “BERT's Pretrained PyTorch” or “BERT PyTorch Pretrained.” Tim Harford: So this is like being able to play my Zelda game on, on an Xbox or a PlayStation, right? Or am I not really understanding what's going on? Jeff Boudier: No, that's exactly what it is. And the thing is, everybody was using the Game Boy, and so it became very popular. And from there, the community sort of gathered to make all of
the other models that were then published by AI researchers available through that library, which was quickly renamed from “BERT’s Pretrained PyTorch” into “transformers” to welcome, like, all of these different, new models. And today, that's—open-source library transformers is what all AI builders are using when they want to access those models, see how they work and build upon them. Tim Harford: What's striking about this field is that it's changing so fast. It's improving so quickly. So how do open-source models keep up with that? How do they get iterated and improved? Jeff Boudier: Actually, it's not so much that open source is keeping up with it; it's actually open source that is driving that is driving this pace of change. And that's because— with open source and open research, data scientists’, researchers can build upon each other's work. They can reproduce each other's work. They can
access each other's work using our open-source libraries, et cetera. So in a sense, it's not really that open-source AI is a new idea. It's rather the opposite. There's been a blip of time in which closed-source AI seems to be the dominant way, but it's really a blip. In fact, you know, none of the incredible advances that we're marveling about today would be possible without open source. We're standing upon the shoulders of 50 years
of research in open-source software. So I think that that's really important; if it wasn't for that, we'd probably be 50 years away from having this—amazing experiences like ChatGPT or Stable Diffusion, et cetera. So it's really open source that is fueling this pace of change—all, all these new models, all these new capabilities. To give you an example: so Meta released, uh, LLaMA large language model just a few months ago. And ever since, there's been this Cambrian explosion of variations and improvements upon the original models. And today there are over a thousand of them that we host and track and evaluate. So, yeah, open source is really the gas and the engine for that.
Malcolm Gladwell: Jeff just made it clear that it is open source, not closed, that sets the pace for AI innovation. If that’s true, then forward-thinking businesses shouldn't shy from leveraging open-source AI to solve their own proprietary challenges. But how? Businesses can face serious obstacles when trying to adopt open-source technologies, like complying with government regulation or making sure their customers’ data stays protected.
In the next part of their conversation, Jeff and Tim discuss how IBM’s collaboration with Hugging Face empowers businesses to tap into the open-source AI community, and how the watsonx platform can enable them to customize those AI models to their needs. Tim Harford: I just wanted to ask about the partnership between Hugging Face and IBM. How did that come about? Jeff Boudier: Well, it came through a conversation. A conversation between our CEO, Clément Delangue, and Bill Higgins, IBM, who's
really, really close to all the amazing research work and open-source work that's happening at IBM. And that conversation sort of sparked the evidence that we needed to do something together. We share a lot of values in terms of the importance of open source, which is fundamental to us; with the importance of doing things in a, an ethics-first way, to enable the community to incorporate ethical considerations in how they're building AI. And we sort of have a different audience to start with, which is all the AI builders use Hugging Face today. To access all the models we talked about; to use them using our open source, and build with them. And IBM has this incredible history of working with enterprise companies and enabling them to make use of that technology in a way that's compliant with everything that an enterprise requires. And so being able to marry these two things together is an amazing opportunity. And now
we can enable the largest corporations that have, sort of, complex requirements in order to deploy machine-learning systems and give them an easy experience—to take advantage of all the latest and greatest that AI has to offer through our platform. Tim Harford: Let's talk about this idea of a, of a single model or a variety of models. Because what I've been hearing you say—you've been saying, “Oh, there are lots of models, there are hundreds of thousands of models available on Hugging Face.” But you've also said “There's this single thing, the transformer, and they're all transformers.” So if they're all basically the same thing, why can't you just build one super-clever model that can do everything? Jeff Boudier: That's a really interesting idea, and very much a new idea. And the reason we have
over a million repositories, 300,000 free and accessible models, on the Hugging Face platform is that models are typically trained to do one thing—and they're typically trained to do one thing with specific types of data. And what became new and evident in the research that came out over the last couple years is that if you train a big enough model with enough data, then those models start to have a—sort of general capabilities. You can ask them to do different things. You can even train them to respond to instructions.
So with the same model you can say, “Hey, summarize this paragraph, translate this into English, start a conversation in French, and then pivot to German.” And so these are general, sort of, language capabilities. And I think when ChatGPT came online, and the world, sort of, discovered these new capabilities, there was, at least for a short period, this sort of idea, this sort of myth, that “the end game of all this is maybe one or a handful of models are so much better than anything else that exists, that they can do anything that we can ask them to do. And that's the only model that we will need.” And I, for one—I think it is a myth. I don't think it is practical, for a variety of reasons. Say you're writing an email and you have, like, this great suggestion of text to, sort of, complete your sentence. Well, that's AI. That's, that's a large language model. That's a transformer model that does that.
So there are a ton of existing use cases like this, and these use cases are powered by specific models that have been trained to do one thing well and to do it fast. If you wanted to apply these, sort of—all-knowing, powerful, Oracle type of model, you would not be able to serve millions of customers through a search engine. You would not be able to complete people's sentences, because the amount of money that you would need, the number of computers that you would need, to run such a service—it just exceeds what is available on the planet. So one reason for which it's not a practical scenario is that it's just very expensive to run those very, very large models. Tim Harford: What I'm hearing is, it's like, “Look, if you want to screw in a screw, you need a screwdriver.” You don't
want an entire toolshed full of tools if the task is to screw in—a screwdriver. And sure, you could bring the toolshed. There are all the tools; there's a screwdriver there. But it's not necessary. It's incredibly expensive. It's incredibly cumbersome, and that cost exists even though, maybe, as the user who's just typing in a, into a prompt box, the user may not see it, but it's still very real. Jeff Boudier: That's right. And then another one is performance. So, taking the screwdriver example, so—and by the way,
like, we're not quite there at this moment where we have this all-knowing, powerful Oracle; that is still sort of a sci-fi scenario. But—we have screwdrivers, but we also have the Leatherman, right? The, the multitool—a Swiss Army knife, and that's, sort of the moment that we are in today. But now if I'm trying to open up my computer, it turns out that it requires a specific kind of a screw, like these tiny little Torx screws, and having a Torx screwdriver will get me much further than trying to use my Leatherman, where maybe I'll get the knife blade and it'll, it will mess up the screw, and maybe eventually I'll get to what I need—but my point is that if you take a very specifically trained model for a particular problem, it will work much better. It will give you better results than a very, very generalistic, big model that can do a lot of things. And so for things like search engines, for things like translation, for things that are very specific, companies are much better off using smaller, more-efficient models that produce better results. Tim Harford: That's really interesting and
presumably then being able to know which model to use, or being able to know who to ask which model to use, becomes a very important capability. Jeff Boudier: Yes, and that's what we're trying to make easy through our platform. Tim Harford: So tell me about how this works with IBM's watsonx platform. How do you see Hugging Faces’ customers benefiting from that? Jeff Boudier: The end goal is to make it really easy for watsonx customers to make use of all the great models and libraries that we talked about—all the, the, the 300,000 models that are today on Hugging Face. And to do this, we need to really collaborate deeply with the IBM teams that build the watsonx platform, so that our libraries, our open source, our models are well integrated into the platform. If you're a single user, if you are a data-science student, and you want to
use a model, we make it super easy, right? We have our open-source library. You can download the model on your computer and run with it then. But in enterprises, there is a vast complexity of infrastructure and rules around what people can do and how the data can be accessed. And all this complexity is, sort of, solved by the watsonx platform. Tim Harford: This season of the Smart Talks podcast features what we're calling New Creators. Do you, do you see yourself as being a creative person? Jeff Boudier: I think it's a requirement for the job. I mean, we're in such a new and rapidly
evolving industry that we have to be creative in order to invent the business models, the use cases, of tomorrow. My role within the company is really to create the business around all of the great work of our science and open-source and product team. And by and large the business model of AI within the whole ecosystem is still something that companies are trying to figure out.
So creativity is really, really important—to really have the conversation with companies, understand what they're trying to do and then build the right kind of solution. So that's, like, where creativity comes into play. Tim Harford: And one of the things that you've, you've been talking about is just this, this growing number of models, this growing number of capabilities, this growing number of use cases—enormously exciting. But also, I think, completely bewildering for most people who are trying to navigate their way through this, this maze of possibilities that is, that is growing faster than they can even learn about it. So how are you helping people navigate and make choices in that environment? And how does the partnership with IBM help with that? Jeff Boudier: Hmm. Well, as I said, like, our vision is that AI machine learning is becoming the default way of creating technology. And that means, like, every product, app,
service that you're going to be using is going to be using AI to do whatever it is better, faster. And I guess there are two competing visions of the world coming from that. There is this vision of the Oracle all-powerful model that can do everything. And our vision is different. Our vision is that every single company will be able to create their own models that they own, uh, that they can use, that they control. And that's the, the, the vision that we're trying
to bring to life through our open-source tools that make this work easy, through our platform where you can find all those pretrained models that are shared by the community. So we really want to empower companies to build their own stuff, not to outsource all the intelligence to a third party, and the watsonx platform from IBM gives those tools to enterprise companies. So that's—you can use the open-source models that Hugging Face offers. Then you can improve them with your own data without sharing that data to a third party.
And then you could do every—all of this work in compliance with whatever governance requirements you have for your company. Maybe you’re a finance services company and you have a specific set of rules. Maybe you’re a healthcare company and you have very strong privacy requirements for patients’ data. Maybe you’re a tech company and you have your, your customers’, your users’ personal information. So you need to be able to do this work, respecting all of that.
Tim Harford: Jeff Boudier. Thank you very much. Jeff Boudier: Thanks so much, Tim. It was fun. Malcolm Gladwell: To create the AI models of the future, we are going to need open source. That means there’s a place for business in the open- source community to harness the game-changing potential of AI innovation. Like Jeff said, businesses face unique challenges they need to solve at scale. Without proper support systems, tapping into open- source AI at enterprise level is daunting: finding the right-sized model for the job, fine-tuning its purpose, all while addressing governance requirements around data, privacy, and ethics.
So for businesses, IBM’s collaboration with Hugging Face is a mark of progress because it signifies that business can tap into open- source AI while preserving enterprise-level integrity. Businesses should embrace the open-source community and the AI future, much like Hugging Face (and its emoji namesake) suggests. I’m Malcolm Gladwell. This is a paid advertisement from IBM.
Smart Talks with IBM is produced by Matt Romano, David Zha [JAH], Nisha [Nih-sha] Venkat, and Royston Beserve, with Jacob Goldstein. We’re edited by Lidia Jean Kott. Our engineers are Jason Gambrell, Sarah Bruguiere [Brew-Ghare (hard G!)], and Ben Tolliday. Theme song by Gramoscope. Special thanks to Carly Migliori, Andy Kelly, Kathy Callaghan [Calla-Han], and the EightBar and IBM teams, as well as the Pushkin marketing team. Smart Talks with IBM is a production of Pushkin Industries and Ruby Studio at iHeartMedia. To find more Pushkin podcasts, listen on the iHeartRadio app, Apple Podcasts, or wherever you listen to podcasts.
2024-05-03 12:23