QLoRA - Efficient Finetuning of Quantized LLMs

QLoRA allows for an efficient finetuning approach that supports using a 4-bit approach. This allows people to fine models using a single GPU. It’s possible to now fine-tune a 33B parameter model in less than 24 GB.

#datascience #machinelearning #lora #peft #qlora #finetuning #largelanguagemodels

Paper: https://arxiv.org/abs/2305.14314
Code+Demo: https://github.com/artidoro/qlora
Samples: https://colab.research.google.com/drive/1kK6xasHiav9nhiRUJjPMZb4fAED4qRHb?usp=sharing
Colab: https://colab.research.google.com/drive/17XEqL1JcmVWjHkT-WczdYkJlNINacwG7?usp=sharing

Background by Vishnu Mohanan: https://unsplash.com/collections/1779288/lb—brain-dump
━━━━━━━━━━━━━━━━━━━━━━━━━
Rajistics Social Media »
● Link Tree: https://linktr.ee/rajistics
● LinkedIn: https://www.linkedin.com/in/rajistics/
━━━━━━━━━━━━━━━━━━━━━━━━━

Source

QLoRA – Efficient Finetuning of Quantized LLMs