Jackson Kek

Home
About
Archive

Archive

Show All ⁹ deep learning ⁸ paper-reading ⁷ transformers ⁵ LLM ² efficient_llms ² Github ¹ Web ¹ diffusion-model ¹ distributed training ¹ pytorch ¹ self-supervised learning ¹ text-to-image ¹

2024

Self-Rewarding Language Models

Fine-tune Llama 2 70B that outperforms GPT-4 0613...

2023

Road to Efficient LLMs 2: QLoRA

QLoRA: Efficient Finetuning of Quantized LLMs

LongLoRA: Efficient Fine tuning of Long Context Large Language Models

100k context length Llama 7B....How?

Road to Efficient LLMs 1: LoRA

Low-Rank Adaptation of Large Language Models

Getting Started with Distributed Data Parallel in PyTorch: A Beginner's Guide

Learn Multi GPU Training with DDP: Step by Step Tutorial and Tips for Deep Learning Scaling

FlashAttention: Fast and Memory Efficient Exact Attention with IO Awareness

Speed up transformers...from hardware pespective?

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

A new approach to learns and reason more like human?

Break-A-Scene: Extracting Multiple Concepts from a Single Image

First Work That Attempt to Learn Multiple Concepts from a Single Image

How to Craft a Website using Jekyll and GitHub Pages

A Beginner's Guide to Building and Hosting Your Own Website

Copyright © Jackson Kek 2024
Powered by Hux Blog |