Codersera

About Services Why Contact Blog Tools

Create Your Imagination

AI-Powered Image Editing

No restrictions, just pure creativity. Browser-based and free!

deepseek

Deep Learning

AI Engineer

+ 3 More

3 min to read

Best DeepSeek R1 Model to Run on a Linux VM: Step-by-Step API Guide

Unleash Your Creativity

AI Image Editor

Create, edit, and transform images with AI - completely free

Stand Out From the Crowd

Professional Resume Builder

Used by professionals from Google, Meta, and Amazon

DeepSeek R1 is a state-of-the-art AI model excelling in math, coding, and logical reasoning tasks. Running it locally on a Linux VM ensures privacy, reduces costs, and avoids cloud latency. This guide walks you through selecting the right model, installing it, and integrating it via API—even if you’re new to AI!

Why DeepSeek R1?

Cost Efficiency: Avoid expensive cloud APIs (saves ~95% vs. OpenAI-o1) 410.
Privacy: Data stays on your VM, ideal for sensitive projects 8.
Performance: Outperforms GPT-4 and Claude-3.5 in math and coding benchmarks 19.

Choosing the Best Model for Your VM

DeepSeek R1 offers distilled models optimized for different hardware:

Model	VRAM Requirement	Use Case
DeepSeek-R1-Distill-Qwen-1.5B	~3.5 GB	Lightweight tasks, low-resource VMs
DeepSeek-R1-Distill-Qwen-7B	~16 GB	Balanced performance (recommended for most users) 12
DeepSeek-R1-Distill-Llama-70B	~161 GB	High-end tasks requiring multi-GPU setups

For Beginners: Start with the 7B model (4.7GB download) for a balance of speed and capability 210.

Step-by-Step Setup on Linux VM

Prerequisites

VM Specifications:
- OS: Ubuntu 22.04/Debian (64-bit) 37.
- RAM: ≥16 GB (32 GB recommended for larger models).
- Storage: ≥50 GB free space 3.
- GPU (Optional): NVIDIA GPU with ≥8GB VRAM for acceleration 1.
Install Dependencies:

sudo apt update && sudo apt install -y curl python3-pip

Step 1: Install Ollama

Ollama simplifies local AI model management. Install it via:

curl -fsSL https://ollama.com/install.sh | sh

Verify installation:

ollama --version  # Should display "ollama version 0.5.7" or later :cite[7]:cite[10]

Step 2: Download the Model

Pull the 7B model (adjust 7b to 1.5b or 70b as needed):

ollama pull deepseek-r1:7b

Check installed models:

ollama list  # Should list "deepseek-r1:7b" :cite[10]

Step 3: Start the API Server

Launch Ollama in server mode:

ollama serve

The API will run at http://localhost:11434.

Step 4: Test the API

Use curl or Python to send requests:

Example 1: Curl Request

curl http://localhost:11434/api/generate -d '{  
  "model": "deepseek-r1:7b",  
  "prompt": "Explain quantum computing in simple terms"  
}'

Example 2: Python Integration

import ollama  

response = ollama.chat(  
    model='deepseek-r1:7b',  
    messages=[{'role': 'user', 'content': 'Write Python code for a Fibonacci sequence'}]  
)  
print(response['message']['content'])

Advanced Tips

GPU Acceleration: Enable CUDA support by installing NVIDIA drivers and adding --gpu to ollama serve .
Optimize Performance: Limit response length with max_tokens and adjust creativity using temperature (0.7 recommended) 9.
Web UI: Deploy Open Web UI for a ChatGPT-like interface:bashCopydocker run -d -p 3000:8080 ghcr.io/open-webui/open-webui:main Access at http://localhost:3000 46.

Troubleshooting

Model Not Found: Ensure you ran ollama pull and check for typos 10.
Out of Memory: Use a smaller model or upgrade VM specs 1.
Slow Responses: Disable background apps or use GPU acceleration 8.

Conclusion

Running DeepSeek R1 on a Linux VM is straightforward with Ollama. The 7B model offers the best balance for beginners, while the API integration opens doors for AI-powered apps. Experiment with different prompts and explore its reasoning prowess—your privacy-focused AI journey starts now!

Further Reading:

FAQ Section: DeepSeek R1 on Linux VM

How do I choose between the 1.5B and 7B models?

The 1.5B model is ideal for basic tasks (e.g., text summarization, simple Q&A) on low-resource VMs (≤8GB RAM). The 7B model (recommended) handles complex reasoning, coding, and math problems better. If your VM has ≥16GB RAM and a mid-tier GPU, start with 7B for balanced performance.

Can I run the 70B model without a multi-GPU setup?

The 70B model requires ~161GB of VRAM, which typically needs enterprise-grade GPUs (e.g., 4x A100s). For personal VMs, stick to the 1.5B or 7B models. If you need 70B-level performance, consider cloud-based solutions like AWS/GCP.

What if my VM runs out of memory while using the 7B model?

Fix 1: Close background apps to free RAM.
Fix 2: Add swap space temporarily

sudo fallocate -l 8G /swapfile && sudo chmod 600 /swapfile
sudo mkswap /swapfile && sudo swapon /swapfile

Fix 3: Switch to the 1.5B model (ollama pull deepseek-r1:1.5b).

Can I use DeepSeek R1 without a GPU?

Yes! Ollama runs models on CPU by default, but responses will be slower. For GPU-like speed on CPU-only VMs, use quantization (e.g., ollama pull deepseek-r1:7b-q4_0).

How do I upgrade my VM later for larger models?

Vertical Scaling: Increase VM RAM/CPU via your cloud provider (e.g., AWS EC2, Azure).
GPU Add-ons: Attach a GPU (e.g., NVIDIA T4) if supported.
Model Swap: Stop Ollama (sudo systemctl stop ollama), pull the new model, and restart.

Redefine Creativity

AI Image Editor

Free browser-based tool for stunning visual creations

Redefine Creativity

AI Image Editor

Free browser-based tool for stunning visual creations

Need expert guidance? Connect with a top Codersera professional today!

;

Stand Out From the Crowd

Professional Resume Builder

Used by professionals from Google, Meta, and Amazon

Codersera

Create Your Imagination

AI-Powered Image Editing

Best DeepSeek R1 Model to Run on a Linux VM: Step-by-Step API Guide

Unleash Your Creativity

AI Image Editor

Stand Out From the Crowd

Professional Resume Builder

Why DeepSeek R1?

Choosing the Best Model for Your VM

Step-by-Step Setup on Linux VM

Prerequisites

Step 1: Install Ollama

Step 2: Download the Model

Step 3: Start the API Server

Step 4: Test the API

Advanced Tips

Troubleshooting

Conclusion

FAQ Section: DeepSeek R1 on Linux VM

How do I choose between the 1.5B and 7B models?

Can I run the 70B model without a multi-GPU setup?

What if my VM runs out of memory while using the 7B model?

Can I use DeepSeek R1 without a GPU?

How do I upgrade my VM later for larger models?

Redefine Creativity

AI Image Editor

Redefine Creativity

AI Image Editor

Stand Out From the Crowd

Professional Resume Builder

Company

Hire

Looking for Job

Support

Tools