GitHub - Vjay15/Finetuning_with_scraps: Study Repo for PEFT LoRA

Fine Tuning and Inferencing

This repo contains all the python notebooks that were created in order to learn about Fine Tuning, LoRA and vLLM inferencing when you don't have any compute and Google Colab is your only friend, except for the notebook where I demonstrate OpenAI Fine Tuning for which the company I interned at gave access to me (Thank you Greatify!)

This repo explores LoRA fine-tuning using freely available tools and libraries, excluding OpenAI. All notebooks are designed for easy execution in Google Colab(T4 instance), and Python scripts require dependency installation.

The Datasets were synthetically generated using Claude and GPT by feeding them a Science and Social Paper

All of this was done as a part of my internship at Greatify.

The following libraries and frameworks are demonstrated:

Huggingface Transformers: For model loading and training.
TRL: Reinforcement Learning utilities for language models.
PEFT: Efficient parameter fine-tuning.
LLaMAFactory: Streamlined LLaMA model fine-tuning.
Unsloth FastLanguageModel: Fast and memory-efficient training.

Additionally, the project covers:

Using vLLM to infer with saved LoRA adapters and respective models.
Hosting a server in Google Colab and accessing the model via API, utilizing cloudflared for tunneling.

Hands-on examples are provided in notebooks for experimentation.

Resources

I am also adding the resources that I used in order to learn it so that it can be beneficial for everyone!

HuggingFace Docs (Especially PEFT)
Huggingface API reference (Since some guides had outdated object args)
LoRA paper
Explanation by the creator of LoRA
vLLM Docs
Straightforward guide for multi LoRA inferencing with vLLM

If you guys need any help or you got stuck do reach out to me!

Finally I want to thank my mentor at Greatify Santhosh for mentoring me and my pookie senior Kashiful Haque for helping me out by guiding me to right sources and also the Google Overlords to have given me access to the T4 GPU in 4 accounts 😉

I will make a blog and explain about the concepts here soon!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LoRA_Finetuning		LoRA_Finetuning
OpenAI		OpenAI
vLLM_Inferencing		vLLM_Inferencing
.gitignore		.gitignore
README.Md		README.Md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine Tuning and Inferencing

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fine Tuning and Inferencing

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages