Galileo's Luna redefines GenAI evaluation, boasting 97% lower costs and 11x faster speeds

VB Transform 2024 returns this July! More than 400 business leaders will gather in San Francisco July 9-11 to learn about advances in GenAI strategy and engage in thought-provoking discussions within the community. Find out how to attend here.

Galileo, a pioneer in enterprise generative AI, has unveiled Galileo Luna, a groundbreaking evaluation-driven model (EFM) suite that will transform the way enterprises evaluate GenAI systems. With Luna, Galileo aims to solve the critical issues of speed, cost, and accuracy that hinder the widespread adoption of generative AI in production environments.

“Galileo created Luna to address the limitations of current GenAI evaluation methods, which are slow, expensive, and often inaccurate,” Vikram Chatterji, co-founder and CEO of Galileo, told VentureBeat. “We were motivated by the need for very low-latency, cost-effective, and highly accurate assessments in a production environment.”

The development of Luna is a significant milestone for Galileo, which has been at the forefront of enterprise GenAI since its founding in early 2021. The company's commitment to pushing the boundaries of AI evaluation is evident in an intensive R&D process spanning nearly a year. To Luna's creations.

Luna, Galileo's groundbreaking suite of evaluation-based models, outperforms leading AI evaluation methodologies in benchmark comparisons for area under the receiver operating characteristic curve (AUROC) scores. Higher AUROC values reaching 0.78 demonstrate Luna's superior accuracy in evaluating enterprise-generated AI systems, outperforming competitors such as GPT-3.5, Trulens Groundedness, and RAGAS Faithfulness. (Image source: Galileo)

Purpose-built models that redefine speed, cost, and accuracy

At the heart of Luna's innovation are small-scale language models carefully tailored to specific evaluation tasks, such as hallucination detection, context quality assessment, data leak prevention, and malicious prompt identification. This special design allows Luna to deliver unparalleled performance across three key metrics: speed, cost, and accuracy.

VB Transform 2024 registration is now open

Join business leaders at a major AI event in San Francisco July 9-11. Connect with your peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry. Register now

“Luna outperforms GPT-3.5 in speed, cost and accuracy through several innovations,” Chatterji explained. “Luna significantly reduces computational overhead and cost by leveraging small language models purpose-built for specific evaluation tasks. These design choices allow for evaluations that are 97% cheaper and 11 times faster than those performed with GPT-3.5.”

But it's not just about speed and cost. Luna also boasts industry-leading accuracy, up to 20% better than previous methods, for detecting hallucinations, immediate injections, personally identifiable information (PII), and more. “Using advanced techniques such as multiple compact language models and intelligent chunking allows the Luna model to better maintain context and provide more accurate evaluations,” Chatterji added.

Comparing the monthly cost of evaluating 1 million queries, Galileo's Luna costs $175 per month, which is significantly cheaper than other methodologies. Luna's purpose-built small language models enable ultra-low-cost evaluation, making it up to 97% more cost-effective than alternatives such as GPT-3.5 ($6,248 per month), RAGAS Faithfulness ($7,994 per month), and Trulens Groundedness ($16,641 per month). . (Image source: Galileo)

Transform evaluation without ground truth datasets.

One of the most surprising aspects of Luna is that it can work without any existing real-world data sets. By leveraging pre-trained evaluation models that are fine-tuned on a variety of domain-specific datasets, Luna eliminates the time-consuming and costly process of creating custom test sets. These innovations streamline the assessment process and reduce reliance on extensive human-generated data.

Luna's potential applications are broad, and Chatterji highlights its relevance in industries that require high reliability and speed in AI evaluation. “Luna is especially powerful for large enterprise applications that require volume and throughput (e.g. millions of queries per month). “Fortune 100 companies in healthcare, finance and telecommunications are finding Luna particularly useful,” he said.

Galileo's Luna delivers unrivaled speed in AI evaluation with a latency of just 0.232 seconds to process a single query. This is a significant improvement over other methodologies such as GPT-3.5 at 2.5 seconds, Galileo Chainpoll at 3.0 seconds, Trulens Groundedness at 3.4 seconds, and RAGAS Faithfulness at 5.4 seconds. Luna's special-purpose compact language model enables extremely low-latency evaluation, up to 11x faster than competing approaches. (Image source: Galileo)

Customization and continuous evolution following rapid GenAI advancements

Use cases range from real-time monitoring of AI output and hallucination detection in AI-generated content to ensuring the safety and quality of chatbot interactions. Additionally, Galileo's Fine Tune product allows Luna to be tailored to specific customer requirements, achieving accuracy levels of 95% or better for critical tasks in industries such as pharmaceuticals and financial services.

As the generative AI landscape continues to rapidly evolve, Galileo is committed to staying at the forefront of innovation. Chatterji emphasized that Luna will expand in three key ways: expanding support for more types of assessment tasks, continuing to improve accuracy, and further reducing costs and latency.

“Galileo is committed to expanding the boundaries of what is possible in AI evaluation and helping organizations bring trustworthy AI into production,” said Chatterji. “As the generative AI landscape continues to evolve, Galileo is committed to providing our customers with cutting-edge assessment capabilities that help businesses deploy AI pragmatically and inspire confidence and trust among consumers.”

With the launch of Luna, Galileo solidifies its position as a leader in enterprise GenAI evaluation. As more organizations seek to harness the power of generative AI, Luna's ability to provide fast, cost-effective, and accurate assessments will be a critical factor in driving widespread adoption of this innovative technology and unlocking its full potential. no see.

VB Daily

Stay informed! Get the latest news in your inbox every day

By subscribing, you agree to VentureBeat's Terms of Service.

Thank you for subscribing. Check out more VB newsletters here.

An error occurred.

Galileo’s Luna redefines GenAI evaluation, boasting 97% lower costs and 11x faster speeds

How xenophobic content on Chinese social media, directed towards Japan, the US, Jews, and others, became the subject of a debate and spreads despite censorship (Li Yuan/New York Times)

Bonus: Interview with Trent Casi, Drone U’s new sales director for PROPS program, on Wingtra, latest in the drone industry and more !!

With Avinox Drive System, DJI takes flight…on two wheels

Leave A Reply Cancel Reply

Meet 4 TV brainiacs who trump the usual suspects

Democratic Senators Are Now Leaking To The Media To Push Biden Out

How xenophobic content on Chinese social media, directed towards Japan, the US, Jews, and others, became the subject of a debate and spreads despite censorship (Li Yuan/New York Times)

Fort Myers – Islands, Beaches and Neighborhoods · Organic Spa Magazine

Heat Rash Cream: Our Top 9 Picks

Selena Gomez and Benny Blanco’s Relationship Timeline

Tennessee college-going rate on the rise

Sexuality in Color: Bodies, Boundaries, and Microaggressions

IVF in zoos ‘could help wild population’

Anant Ambani-Radhika Merchant Sangeet: Couple Dazzle In Abu Jani and Sandeep Khosla Couture

What Is the Biden Campaign’s Theory of Victory Now?

Bonus: Interview with Trent Casi, Drone U’s new sales director for PROPS program, on Wingtra, latest in the drone industry and more !!

Popular Posts

IVF in zoos ‘could help wild population’

Lindsay Hubbard accuses Dorinda Medley of leaking pregnancy news

Researchers discover new T cells, genes related to immune disorders

Most Read

Joe Biden just shook his base’s faith. He’s got to win it back

Federal Student Aid chief to step down amid FAFSA chaos

Who Can Benefit from Testicular Prosthesis Implants?

Galileo’s Luna redefines GenAI evaluation, boasting 97% lower costs and 11x faster speeds

Purpose-built models that redefine speed, cost, and accuracy

Transform evaluation without ground truth datasets.

Customization and continuous evolution following rapid GenAI advancements

Related Posts

Leave A Reply Cancel Reply