Nab theme, more professional navigation theme
Ctrl + D Favorites
Current Position:fig. beginning " AI knowledge

AI Engineering Institute: 3Fine-tuning (fine-tuning of large language models)

2024-11-19 805

📚 Structure of the database

Models/Catalog Description and content
Axolotl A framework for fine-tuning language models
Gemma Google's latest implementation of the Big Language Model
finetune-gemma.ipynb - gemma-sft.py - Gemma_finetuning_notebook.ipynb Fine-tuning notebooks and scripts
LLama2 Meta's Open Source Large Language Model
generate_response_stream.py - Llama2_finetuning_notebook.ipynb - Llama_2_Fine_Tuning_using_QLora.ipynb Implementation and fine-tuning guidelines
Llama3 Upcoming Meta Large Language Modeling Experiments
Llama3_finetuning_notebook.ipynb Initial fine-tuning experiments
LlamaFactory A Framework for Training and Deployment of Large Language Models
LLMArchitecture/ParameterCount Technical details of the model architecture
Mistral-7b Mistral AI The 7 billion parameter model
LLM_evaluation_harness_for_Arc_Easy_and_SST.ipynb - Mistral_Colab_Finetune_ipynb_Colab_Final.ipynb - notebooks_chatml_inference.ipynb - notebooks_DPO_fine_tuning.ipynb - notebooks_SFTTrainer TRL.ipynb - SFT.py Integrated notebook for assessment, fine-tuning and reasoning
Mixtral Mixtral's Expert Mixing Model
Mixtral_fine_tuning.ipynb Fine-tuning Realization
VLM visual language model
Florence2_finetuning_notebook.ipynb - PaliGemma_finetuning_notebook.ipynb Visual language model implementation

🎯 Module Overview

1. LLM architecture

  • Explore the following model implementations:
    • Llama2 (Meta's open source model)
    • Mistral-7b (efficient 7 billion parameter model)
    • Mixtral (expert hybrid architecture)
    • Gemma (Google's latest contribution)
    • Llama3 (upcoming experiment)

2. 🛠️ fine-tuning technology

  • implementation strategy
  • The LoRA (Low Rank Adaptation) approach
  • Advanced Optimization Methods

3. 🏗️ model architecture analysis

  • An in-depth study of the model structure
  • Parameter calculation method
  • Scalability Considerations

4. 🔧 Professional realization

  • Code Llama for programming tasks
  • Visual language modeling:
    • Florence2
    • PaliGemma

5. 💻 Practical applications

  • Integrated Jupyter Notebook
  • Response Generation Pipeline
  • Reasoning Implementation Guide

6. 🚀 Advanced Themes

  • DPO (Direct Preference Optimization)
  • SFT (supervised fine tuning)
  • Assessment methodology

Recommended

Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

Scan the code to follow

qrcode

Contact Us

Top

en_USEnglish