LLMED Program Module 1 Project: Fine-Tune Your First Custom LLM

Your First Milestone Toward Certification

The Module 1 Capstone Project is your first major milestone toward certification.

By completing it, you’ll demonstrate that you can fine-tune an open-weights model end-to-end — from dataset preprocessing to model optimization — using frameworks like Hugging Face Transformers, PEFT, and Accelerate.

You’ll also practice evaluation, quantization, and reproducibility, essential skills for real-world model development.

In this lesson, you will review the objectives, deliverables, and evaluation criteria for your first project.

This lesson outlines what you'll need to complete for your Module 1 Capstone Project— the main deliverable for this part of the certification.

As you go through these requirements, don't worry if some parts feel unfamiliar. You're not expected to know how to do these steps yet.

The lessons in Weeks 1 through 4 of this module will teach you the knowledge and skills needed to complete this project successfully.

Use this page as a preview of what you'll be ready to build by the end of Module 1.

Project Objectives

In this project, you’ll perform parameter-efficient fine-tuning (PEFT) on a small or medium-sized open-weights LLM such as Mistral 7B, Phi-3 Mini, or Qwen-1.5B, adapting it for a specific task of your choice.

You’ll apply your knowledge of data preparation, model configuration, and optimization techniques to produce:

A reproducible fine-tuning setup
A set of fine-tuned model weights
A short evaluation and quantization report
A model card summarizing key details and results

What You’ll Build

You’ll build and document a fine-tuned model that adapts an open-weights base LLM for a specific task or dataset.

Your project should include:

Dataset Selection and Preparation:
- Choose a small to medium-sized dataset from platforms like Hugging Face Datasets.
- Perform basic preprocessing (e.g., tokenization, splitting into train/validation).
- You are not required to collect or clean your own dataset — focus on understanding how data flows into the fine-tuning pipeline.
Fine-Tuning Implementation:
- Apply LoRA or QLoRA for parameter-efficient adaptation.
- Configure and run training using Hugging Face Transformers
- Optionally explore multi-GPU training or DeepSpeed ZeRO for scaling.
Evaluation and Optimization:
- Evaluate using lm-evaluation-harness or another benchmark tool.
- Apply quantization (e.g., bitsandbytes or GGUF) for efficiency.
- Track results and versions on Hugging Face Hub or Weights & Biases.
Documentation:
- Include a short model card describing your dataset, configuration, and metrics.
- Ensure your code and results are reproducible using notebooks or scripts.

Example Project Ideas

Not sure what to fine-tune your model for? Here are a few directions to help you choose a dataset and define your project goal.

All of these can be done using existing public datasets available on Hugging Face Datasets.
You don’t need to collect or clean your own data — simply pick a relevant dataset and focus on fine-tuning, evaluation, and optimization.

Domain-Specific Instruction Tuning

Fine-tune your model to handle domain-specific queries using a dataset specialized in a particular field such as finance, healthcare, or legal text.

Example: fine-tune on financial QA or stock report datasets so the model can answer questions about companies or filings in a concise, factual way.
Objective: improve domain fluency and response accuracy compared to the base model.

Technical Assistant for Developers

Use a dataset of technical commands or documentation (e.g., Docker, SQL, Git, or Linux) to train your model to respond to natural-language queries.

Example: fine-tune a model that converts prompts like “create a new Docker container for Redis” into the right command syntax.
Objective: demonstrate task-specific instruction tuning and precise text generation.

Reasoning and Problem Solving

Work with benchmark datasets such as GSM8K (grade-school math problems) or ARC (AI2 Reasoning Challenge) to improve reasoning or step-by-step problem-solving ability.

Example: fine-tune your model on a subset of GSM8K to teach it how to reason through arithmetic problems.
Objective: evaluate the model’s reasoning capability before and after fine-tuning.

Explore Your Own Idea

You’re not limited to these examples — explore any dataset or task that interests you.
Just ensure the dataset is publicly available, appropriately licensed, and small enough to fine-tune efficiently.

If you’re unsure where to start, browse Hugging Face Datasets for inspiration — many datasets include sample scripts and documentation that make setup easy.

Module 1 projects are evaluated in the review cycle for the month in which they are submitted.
To be included in that month’s review, send in your project no later than one of these dates:

✅ November 07, 2025 — 11
PM UTC
✅ December 05, 2025 — 11
PM UTC
✅ January 02, 2026 — 11
PM UTC
✅ February 06, 2026 — 11
PM UTC
✅ March 06, 2026 — 11
PM UTC

If you don’t meet a listed deadline, you can still submit before the next month’s date to be considered in that cycle.

The review process usually takes up to 2 weeks after the deadline, which includes receiving reviewer feedback and making any necessary updates.

Submission Checklist

To complete this project, submit the following deliverables:

1. Project Publication

Create a short publication on Ready Tensor that:

Describes your objective, dataset, and methodology
Summarizes your fine-tuning configuration and results
Includes charts or tables for metrics and loss curves
Meets at least 70% of the Technical Evaluation Rubric for model development projects

📄 Publication Evaluation Rubric

2. GitHub Repository

Submit a repo that:

Contains reproducible code or Colab notebooks
Includes your dataset preparation and fine-tuning scripts
Documents dependencies and environment setup
Includes evaluation, quantization, and model card files
Meets 70% of the “Essential” level in the repository evaluation rubric

📄 Repository Evaluation Rubric

What You’ll Earn

Successfully completing this project earns you the LLM Fine-Tuning Specialist credential — recognizing your ability to fine-tune and optimize large language models for production use.

Your Next Step

Move on to the upcoming lessons in Module 1.
They’ll give you the knowledge and code examples you need to complete this project successfully and earn your first certification credential.

🏠 Home - All Lessons

⬅️ Previous - How to Succeed in This Program

📚 First lesson - What Powers ChatGPT and Modern AI