To LLM or Not to LLM: Tips for Responsible Innovation

To LLM or Not to LLM: Tips for Responsible Innovation

Danielle Maxwell, Prema Roman | Wednesday, May 8, 2024 |  LLMs AI MLPythonData

We’re seeing a proliferation of Large Language Models (LLMs) as companies seek to replicate OpenAI’s success. In this post, two AI engineers respond to LLM FAQs and offer tips for responsible innovation.

In this post, Danielle Maxwell and Prema Roman, two AI engineers at Rotational Labs, respond to common LLM-related questions we get from prospective clients.

Question: What are some costs of building LLMs that get overlooked?

Here’s the thing: there is a lot that went into the development of ChatGPT. Microsoft invested a billion dollars in OpenAI that enabled it to set up the massive infrastructure needed to train ChatGPT on terabytes of data.

On top of that consider the energy consumption incurred as the model was being trained. Timnit Gebru et al. in their research paper referenced a benchmark study conducted by Strubell et al. where they made the following observation:

Training a single BERT base model (without hyper parameter tuning) on GPUs was estimated to require as much energy as a trans-American flight.

They also note that the size of the data that is used to train these models makes it all the more difficult to effectively understand and document the data and its characteristics.

This only scratches the surface of all the complexities and risks organizations must face as they consider using LLMs within their organization. All of this work was done to only develop the model. The work doesn’t end there. This model has to be integrated into a software application that will serve a business need. This means that there will be additional costs to set up the infrastructure for the software application. But that’s not all - because this is a machine learning application instead of a traditional software application, work needs to be done to maintain both the model AND the software application.

As you can see, replicating OpenAI’s success is a costly affair. And it’s not necessarily going to translate into responsible innovation in most cases.

Question: Are LLMs even accurate? How do you know if the LLM is giving the right responses?

All models suffer from “drift” over time, that is, their ability to make accurate inferences decreases. This means that models need to be retrained on a periodic basis. Companies also have to account for inaccurate inferences, which means that they will need a way to collect that information and use it to refine the model.

There are also cases where implementing LLMs led to catastrophic results. This is because LLMs are prone to hallucinations, which are nonsensical or misleading responses to user queries. For example, Air Canada was forced to pay one of its customers after its chatbot mistakenly stated that the customer can apply for bereavement rates and will be refunded retroactively. OpenAI also suffered from an outage that stemmed from a bug that was introduced when it upgraded its application. The reality is that such bugs are possible due to the added complexity of LLMs.

There are strategies that have been introduced to mitigate hallucinations such as prompt engineering and retrieval augmented generation (RAG). But these strategies only add more complexity to an already complex system.

Question: Do we really need our own LLM?

While LLMs have been instrumental and have unlocked a lot of use cases that were previously not possible, we have to consider the trade-offs. It may appear that if you are not using LLMs, you are missing out. But that is far from the truth.

The best approach to achieve ROI on your investments is to first take a step back and ask yourself what is the end goal you wish to achieve. After that, have your team work on brainstorming ideas on how to achieve this goal. LLMs are only one set of tools in a large collection of tools that businesses can leverage.

It’s really important to determine what type of machine learning approach makes sense for a given problem before diving into costly compute and hardware spending. This is actually one of the key tenets of Responsible Innovation, an approach we use at Rotational Labs.

We propose answering the following questions at the start of a project:

  • Can the problem be solved by using business rules?
  • Can the problem be solved by using advanced analytics?
  • Can the problem be solved by using traditional machine learning?

What we have observed in our experience is that it is best to start simple and work your way up. Believe it or not, in spite of all the stories around LLMs being this catch-all solution to solve all business problems, there are many problems that businesses currently face that can be solved by traditional machine learning.

Consider a use case where your business has a content management system that is highly disorganized and not tagged and classified. An employee who is already pressed for time now has to go through this disorganized system to find the one document that they need. And if it’s a new employee, they may not even know what words and acronyms to search for yet!

Now multiply that with the number of employees and the number of times they have to do this on a regular basis. You can see how much productivity is lost by this activity, not to mention the frustration your employees feel when doing this task. This is a task that several existing classification models are good at. None of them need to be trained on the amount of data that ChatGPT was trained on nor do they require expensive hardware.

Stay tuned!

In the next post, read more about Responsible Innovation through the lens of a recent project Danielle and Prema worked on (automatically detecting “hype language” in marketing copy); as you’ll see, it really pays to start out with simple models to see how far you can get!

Photo by Elena Mozhvilo on Unsplash

About This Post

When it comes to AI projects, be cautious about the hype around cutting-edge technologies that make promises they can't deliver

Share this post:

Recommended  Rotations

butterfly
View all

How to build a text-to-sql LLM application

As industry races for use cases of Large Language Models, software devs have emerged as early adopters. Can LLMs help us translate between tech and talk? Let’s build a text-to-SQL application with Vanna and Streamlit!

Jun 7, 2024

To LLM or Not to LLM (Part 2): Starting Simple

Sick of hearing about hyped up AI solutions that sound like hot air? 🧐 Let’s use boring old ML to detect hype in AI marketing text and see why starting with a simple ML approach is still your best bet 90% of the time.

Monitoring Real-Time Machine Learning Applications With Prefect

The last few years have marked a shift in industry away from research-oriented machine learning. These days, shipping models early and often is critical, and open source tools like Prefect can speed up the process of operationalizing ML.

Jan 26, 2024
Enter Your Email To Subscribe