Blockchain

Llama-3 Fine-Tuning Achieves 90% of GPT-4’s Performance at Lower Cost

July 14, 2024

Luisa Crawford
Jul 14, 2024 02:46

Llama-3 fine-tuning demonstrates significant performance gains, achieving 90% of GPT-4’s accuracy at a fraction of the cost, according to together.ai.

The success of Llama-3 has been remarkable, showcasing that open-source models are closing the gap with their closed-source counterparts, according to together.ai. By leveraging proprietary data, customers have been able to fine-tune smaller open-source software (OSS) models like Llama-3 to achieve higher accuracy than top-tier closed-source models.

Fine-Tuning Process

Together AI’s platform allows users to fine-tune Llama-3-8B on proprietary data, creating custom models that outperform larger OSS alternatives like Llama-3-70B and are comparable to leading closed-source models like GPT-4, all at a fraction of the cost. A detailed guide demonstrates how a fine-tuned Llama-3 8B model improved from 47% accuracy to 65%, surpassing Llama-3-70B’s 64% and nearing GPT-4’s 71% accuracy.

The fine-tuning process involves several steps, including dataset transformation, uploading and verifying datasets, starting a fine-tuning job, and running evaluations to compare the results. The initial step requires downloading the Math Instruct dataset from HuggingFace, cleaning it up, and transforming it into a JSONL file format suitable for Together’s platform.

Dataset Transformation

The transformation process involves loading the original JSON data, defining the Llama-3 prompt format, and converting the data into the correct format. This formatted dataset is then validated using Together’s SDK before being uploaded for fine-tuning.

Uploading and Fine-Tuning

Once the dataset is prepared, it is uploaded to Together AI via the Python SDK. The fine-tuning job is then created using the Llama-3-8B base model, specifying the dataset, number of epochs, and other parameters. Users can monitor the fine-tuning job through Together AI’s dashboard.

Evaluation and Results

After fine-tuning, the model’s performance is evaluated using 1000 math problems. The fine-tuned Llama-3-8B model’s accuracy is compared to the base Llama-3-8B, Llama-3-70B, and GPT-4. The fine-tuned model achieved a 65.2% accuracy, outperforming the base model’s 47.2% and Llama-3-70B’s 64.2%, and coming close to GPT-4’s 71.4% accuracy.

The results indicate that the fine-tuned Llama-3-8B model outperformed the base model by nearly 20%, surpassed the top OSS model Llama-3-70B, and achieved over 90% of GPT-4’s accuracy. Additionally, the fine-tuned model is faster, 50 times cheaper than GPT-4, and offers full ownership of the model and weights.

Conclusion

This fine-tuning approach demonstrates that small open-source models like Llama-3-8B can be customized to perform specific tasks with high accuracy, speed, and cost-efficiency. Users can leverage their proprietary data to fine-tune a model and either host it on Together AI or run it independently, maintaining full control and ownership.

The Llama-3-8B model trained on math problems outperformed leading OSS models and approached GPT-4’s performance, with a total fine-tuning cost of less than $100 on Together AI.

Image source: Shutterstock

Credit: Source link

Llama-3 Fine-Tuning Achieves 90% of GPT-4’s Performance at Lower Cost

Fine-Tuning Process

Dataset Transformation

Uploading and Fine-Tuning

Evaluation and Results

Conclusion

LEAVE A REPLY Cancel reply

MOST POPULAR

Crypto market regains 11% capitalization in a week

Top Exchanges Record Influx of 6.84 Trillion SHIB in 24 Hours

Bitcoin Dogs Set To Make History With First Ever ICO on...

WATER memecoin gains recede after Ronaldinho’s promo

HOT NEWS

There Are Hidden Giants Investing In The Bitcoin ETFs: Bitwise

Why Bitcoin (BTC) Is Up 85% This Year: Q2 2023 Review

Bitget Partners with Comedian Adam Devine for SetForChange Campaign to Promote...

Will DOT Surge 35% by May End?

EDITOR PICKS

Solana ETF Momentum Grows Amid Reports of SEC Engagement

NEAR, ATOM, DOT: Wyckoff Patterns with Explosive Bull Market Long-Term Potential

NVIDIA’s TensorRT-LLM Multiblock Attention Enhances AI Inference on HGX H200

POPULAR POSTS

The Best Cloud Mining Site for Passive Income in 2023

Kadena vs. Solana: Ultimate Comparison

How To Stake Polygon (MATIC) Using Ledger and MetaMask

POPULAR CATEGORY

Apple’s New Guidelines Permit NFTs But There’s a Caveat

Top Trending Crypto Coins on DEXTools – World Record Banana, ELF...