Skip to content

Commit d750434

Browse files
committed
pa4 more edits
1 parent cf9bb61 commit d750434

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

_pages/dat450/assignment4.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,13 @@
11
---
22
layout: page
3-
title: 'DAT450/DIT247: Programming Assignment 4: Supervised Fine-Tuning (SFT) a Small LLM with LoRA'
3+
title: 'DAT450/DIT247: Programming Assignment 4: Supervised Fine-Tuning (SFT) with LoRA'
44
permalink: /courses/dat450/assignment4/
55
description:
66
nav: false
77
nav_order: 4
88
---
99

10-
# DAT450/DIT247: Programming Assignment 4: Supervised Fine-Tuning (SFT) a Small LLM with LoRA
10+
# DAT450/DIT247: Programming Assignment 4: Supervised Fine-Tuning (SFT) with LoRA
1111

1212
In this assignment, you will perform supervised fine-tuning (SFT) of a small open LLM (preferably [OLMo-2 1B](https://huggingface.co/allenai/OLMo-2-0425-1B)) on [Alpaca](https://huggingface.co/datasets/tatsu-lab/alpaca), a dataset of 52k instructions generated by OpenAI's text-davinci-003 engine. You will convert this dataset into instruction-response pairs, fine-tune a causal language model using LoRA (Low-Rank Adaptation), and evaluate it through prompted inference and comparison with other methods.
1313

0 commit comments

Comments
 (0)