Blockchain

Llama-3 Fine-Tuning Achieves 90% of GPT-4’s Performance at Lower Cost

July 14, 2024

Luisa Crawford
Jul 14, 2024 02:46

Llama-3 fine-tuning demonstrates significant performance gains, achieving 90% of GPT-4’s accuracy at a fraction of the cost, according to together.ai.

The success of Llama-3 has been remarkable, showcasing that open-source models are closing the gap with their closed-source counterparts, according to together.ai. By leveraging proprietary data, customers have been able to fine-tune smaller open-source software (OSS) models like Llama-3 to achieve higher accuracy than top-tier closed-source models.

Fine-Tuning Process

Together AI’s platform allows users to fine-tune Llama-3-8B on proprietary data, creating custom models that outperform larger OSS alternatives like Llama-3-70B and are comparable to leading closed-source models like GPT-4, all at a fraction of the cost. A detailed guide demonstrates how a fine-tuned Llama-3 8B model improved from 47% accuracy to 65%, surpassing Llama-3-70B’s 64% and nearing GPT-4’s 71% accuracy.

The fine-tuning process involves several steps, including dataset transformation, uploading and verifying datasets, starting a fine-tuning job, and running evaluations to compare the results. The initial step requires downloading the Math Instruct dataset from HuggingFace, cleaning it up, and transforming it into a JSONL file format suitable for Together’s platform.

Dataset Transformation

The transformation process involves loading the original JSON data, defining the Llama-3 prompt format, and converting the data into the correct format. This formatted dataset is then validated using Together’s SDK before being uploaded for fine-tuning.

Uploading and Fine-Tuning

Once the dataset is prepared, it is uploaded to Together AI via the Python SDK. The fine-tuning job is then created using the Llama-3-8B base model, specifying the dataset, number of epochs, and other parameters. Users can monitor the fine-tuning job through Together AI’s dashboard.

Evaluation and Results

After fine-tuning, the model’s performance is evaluated using 1000 math problems. The fine-tuned Llama-3-8B model’s accuracy is compared to the base Llama-3-8B, Llama-3-70B, and GPT-4. The fine-tuned model achieved a 65.2% accuracy, outperforming the base model’s 47.2% and Llama-3-70B’s 64.2%, and coming close to GPT-4’s 71.4% accuracy.

The results indicate that the fine-tuned Llama-3-8B model outperformed the base model by nearly 20%, surpassed the top OSS model Llama-3-70B, and achieved over 90% of GPT-4’s accuracy. Additionally, the fine-tuned model is faster, 50 times cheaper than GPT-4, and offers full ownership of the model and weights.

Conclusion

This fine-tuning approach demonstrates that small open-source models like Llama-3-8B can be customized to perform specific tasks with high accuracy, speed, and cost-efficiency. Users can leverage their proprietary data to fine-tune a model and either host it on Together AI or run it independently, maintaining full control and ownership.

The Llama-3-8B model trained on math problems outperformed leading OSS models and approached GPT-4’s performance, with a total fine-tuning cost of less than $100 on Together AI.

Image source: Shutterstock

Credit: Source link

Llama-3 Fine-Tuning Achieves 90% of GPT-4’s Performance at Lower Cost

Fine-Tuning Process

Dataset Transformation

Uploading and Fine-Tuning

Evaluation and Results

Conclusion

LEAVE A REPLY Cancel reply

MOST POPULAR

Did Sam Bankman Fried Try To Pay Off Donald Trump To...

Coinbase demands SEC respond to rulemaking petition

Dogwifhat (WIF) Poised For Big Growth — Analyst

VeChain (VET) explains why its PoA 2.0 upgrade is a big...

HOT NEWS

Massive Spike in Important Shiba Inu (SHIB) Metric: What Does it...

Christmas Rally Countdown: Don’t Miss These Cryptos Ready to Skyrocket!

XRP/USD gave away half of the 2023 gains. A bearish channel...

A Crucial Market Test Looms for Bitcoin Cash (BCH) and Polygon...

EDITOR PICKS

Dogecoin Whales Go Ham As They Buy 560M DOGE In One...

Stablecoins Quietly Balloon by $14B in January — Who’s Leading the...

BONK Early Investor Who Also Predicted Shiba Inu Has Just Purchased...

POPULAR POSTS

The Best Cloud Mining Site for Passive Income in 2023

Kadena vs. Solana: Ultimate Comparison

How To Stake Polygon (MATIC) Using Ledger and MetaMask

POPULAR CATEGORY

Avalanche Launches Avaissance to Boost NFT Ecosystem

Stacks (STX) Price Prediction: Potential Turbulence On The Horizon And The...