Accuracy and Consumption analysis from a compressed model by CompactifAI from Multiverse Computing

Abstract

This study evaluates the performance of a compression method, called CompactifAI, developed by Multiverse Computing, applied to the large language model Llama 3.1 8Bllama. The evaluation focused on model efficiency (in terms of energy consumption) and accuracy using respectively the frameworks Codecarboncodecarbon and Ragasragas. A comparison was performed between the model compressed with CompactifAIcompactifaicompactifai2 and its full-size version. Our findings reveal that the compressed model using CompactifAI not only significantly reduced the computational resources but also maintained the model accuracy, making the model more efficient, scalable and cost-effective.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…