Toggle navigation
Jackson Kek
Home
About
Archive
Archive
Show All
9
deep learning
8
paper-reading
7
transformers
5
LLM
2
efficient_llms
2
Github
1
Web
1
diffusion-model
1
distributed training
1
pytorch
1
self-supervised learning
1
text-to-image
1
2024
Self-Rewarding Language Models
Fine-tune Llama 2 70B that outperforms GPT-4 0613...
2023
Road to Efficient LLMs 2: QLoRA
QLoRA: Efficient Finetuning of Quantized LLMs
LongLoRA: Efficient Fine tuning of Long Context Large Language Models
100k context length Llama 7B....How?
Road to Efficient LLMs 1: LoRA
Low-Rank Adaptation of Large Language Models
Getting Started with Distributed Data Parallel in PyTorch: A Beginner's Guide
Learn Multi GPU Training with DDP: Step by Step Tutorial and Tips for Deep Learning Scaling
FlashAttention: Fast and Memory Efficient Exact Attention with IO Awareness
Speed up transformers...from hardware pespective?
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
A new approach to learns and reason more like human?
Break-A-Scene: Extracting Multiple Concepts from a Single Image
First Work That Attempt to Learn Multiple Concepts from a Single Image
How to Craft a Website using Jekyll and GitHub Pages
A Beginner's Guide to Building and Hosting Your Own Website