Personal Hardware Plan Project — Hardware Planning for Local AI (Chapter 20)

This final project synthesizes the course concepts into a concrete hardware plan for your specific situation.

Project Structure

Create a document containing:

Current situation assessment
Use case definition
Budget constraints
Hardware recommendation
Alternative configurations
Timeline and staged approach

Section 1: Current Situation Assessment

Document your starting point:

## Current Hardware

- GPU: [Model and VRAM]
- System RAM: [Amount and speed]
- Storage: [Type, capacity, speed]
- CPU: [Model]
- Budget ceiling: $[Amount]

## Current Pain Points

1. [Specific frustration with current setup]
2. [Specific model not usable]
3. [Performance unacceptable for X use case]

Section 2: Use Case Definition

Define specific workloads:

## Primary Use Cases

| Use Case | Model | Precision | Context | Concurrent Users |
|----------|-------|-----------|---------|-----------------|
| Coding assistant | Mistral 7B | INT4 | 16K | 1 |
| Document analysis | 13B | INT8 | 8K | 1 |
| Batch summarization | 70B-8B ensemble | INT4 | 4K | Background |

## Frequency

- Daily interactive use: ~4 hours
- Weekly batch processing: ~8 hours
- Monthly fine-tuning experiments: ~24 hours

Section 3: VRAM Calculation

Calculate minimum requirements:

## VRAM Calculation

Base model requirements (per Chapter 2):

| Model | FP16 (GB) | INT8 (GB) | INT4 (GB) |
|-------|-----------|-----------|-----------|
| Mistral 7B | 14 | 8 | 4 |
| Llama 3 13B | 26 | 14 | 7 |
| Llama 3 70B | 140 | 74 | 37 |

Context requirements: 1-4GB depending on length
Overhead factor: 1.25x

Recommended VRAM: 16GB (for 13B INT4 with headroom)

Section 4: Hardware Recommendation

## Recommended Configuration

### Option A: Mid-Range Build (~$1500)

| Component | Selection | Price |
|-----------|-----------|-------|
| GPU | RTX 4070 Ti 12GB | $700 |
| CPU | Ryzen 7 7700X | $250 |
| RAM | 32GB DDR5 | $100 |
| Storage | 1TB NVMe | $80 |
| PSU | 850W | $100 |
| Case | Mid-tower | $80 |
| **Total** | | **$1310** |

### Option B: High-Performance Build (~$2800)

[Full RTX 4090 build details]

### Option C: Hybrid Approach ([$600] eGPU + existing system)

[Details for adding eGPU to current setup]

Section 5: Alternative Configurations

Include at least three alternatives:

Cloud-first approach: Use existing hardware, pay $X/month for cloud A100
Used hardware route: RTX 3090 24GB used at $Y, with savings used for RAM
Apple Silicon path: MacBook Pro M3 Max for unified memory experience

Calculate break-even points:

## Cloud vs. Purchase Analysis

Cloud A100 40GB: $1.50/hr × 730 hrs (24/7) = $1,095/month
RTX 4090 24GB purchase: $1,700 + $50/month electricity = $2,300 one-time.

Break-even: At 1,533 hours/month use, purchased GPU becomes cheaper.
Current projection: 200 hours/month actual use.
Cloud remains cheaper for this usage pattern.

Section 6: Implementation Timeline

## Implementation Plan

### Phase 1 (Immediate): Cloud + existing hardware
- Maintain current setup
- Use Vast.ai spot instances for 70B model access
- Cost: ~$100/month

### Phase 2 (Month 3): Mid-range upgrade
- Purchase GPU and core components ($1,300)
- Decommission cloud (save $800/month)
- ROI: 1.6 months

### Phase 3 (Month 12): Evaluation
- Assess if current GPU meets growing needs
- Plan for RTX 5090 / next generation upgrade
- Revisit cloud usage patterns

Submission Checklist

Before finalizing your plan:

VRAM calculation completed with specific numbers
At least three configurations compared
Real prices researched (not estimated)
Break-even analysis for cloud vs. purchase
Timeline considers actual budget flow
Failure modes acknowledged and mitigated

My Hardware Plan for Local AI

Current Status

GPU: nvidia-smi --query-gpu=name,memory.total
RAM: [system info]
Primary models: [List of models you run regularly]

VRAM Requirements

Model	FP16 (GB)	INT8 (GB)	INT4 (GB)
[Your table here]

Recommended Build

[Your specific configuration]

Purchase Decision

Cloud [$X]/month vs. Purchase [$Y] one-time Break-even: [Calculated months]



## Course Summary

This course covered the essential hardware knowledge for running AI models locally. You learned that VRAM is the primary constraint on model selection, how to calculate precise VRAM requirements, and how to select GPUs across budget tiers. The course included practical guidance on supporting components (motherboard, PSU, cooling), alternative platforms (AMD ROCm, Apple Silicon), and complementary strategies (eGPU, cloud fallback). Multiple build configurations from entry-level to high-performance provided concrete examples. Armed with this knowledge, you can make informed hardware decisions that match your specific workload, budget, and timeline.

Remember: Hardware planning is not about purchasing the most expensive system—it is about purchasing the right system for your actual use case at the lowest necessary cost.

Write your complete hardware plan using the template above. Include real prices for components and calculate your break-even point for local versus cloud compute. Save this document for future reference.

Course Summary

This course covered the essential hardware knowledge for running AI models locally. You learned that VRAM is the primary constraint on model selection, how to calculate precise VRAM requirements, and how to select GPUs across budget tiers. The course included practical guidance on supporting components (motherboard, PSU, cooling), alternative platforms (AMD ROCm, Apple Silicon), and complementary strategies (eGPU, cloud fallback). Multiple build configurations from entry-level to high-performance provided concrete examples. Armed with this knowledge, you can make informed hardware decisions that match your specific workload, budget, and timeline.

Remember: Hardware planning is not about purchasing the most expensive system—it is about purchasing the right system for your actual use case at the lowest necessary cost.