https://github.com/cebor/ollama_models
Mirrored from:
https://github.com/cebor/ollama_models
Last synced: about 1 month ago
JSON representation
Mirrored from:
- Host: GitHub
- URL: https://github.com/cebor/ollama_models
- Owner: cebor
- License: mit
- Created: 2026-04-11T04:41:44.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2026-05-04T21:55:23.000Z (about 2 months ago)
- Last Synced: 2026-05-04T23:37:35.859Z (about 2 months ago)
- Language: PowerShell
- Homepage: https://gitlab.stkn.org/felix/ollama_models
- Size: 15.6 KB
- Stars: 1
- Watchers: 0
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ollama-modelfiles
Ollama Modelfile collection for local AI coding workflows, optimized for two machines.
## Hardware
| Machine | Chip | RAM | VRAM |
|---------|------|-----|------|
| MacBook | Apple M3 | 24 GB Unified Memory | — |
| PC | NVIDIA RTX 5090 | 64 GB RAM | 32 GB VRAM |
## Models
| Model | Type | Size (Q4_K_M) | Mac ctx | PC ctx |
|-------|------|---------------|---------|--------|
| gemma4:26b-a4b-it-q4_K_M | MoE | ~18 GB | 16384 | 131072 |
| gemma4:31b-it-q4_K_M | Dense | ~21 GB | 8192 | 65536 |
| qwen3.6:27b-q4_K_M | Dense | ~17 GB | 8192 | 65536 |
| qwen3.6:35b-a3b-q4_K_M | MoE | ~24 GB | — | 131072|
## Setup
### Mac Setup
```bash
# Create custom configs
ollama create gemma4-26b-coding -f ./mac-m3-24gb/gemma4-26b-a4b-it-q4_K_M.txt
ollama create gemma4-31b-planning -f ./mac-m3-24gb/gemma4-31b-it-q4_K_M.txt
ollama create qwen3.6-27b-planning -f ./mac-m3-24gb/qwen3.6-27b-q4_K_M.txt
ollama list
```
### PC Setup
```bash
# Create custom configs
ollama create gemma4-26b-coding -f ./pc-rtx5090-32gb/gemma4-26b-a4b-it-q4_K_M.txt
ollama create gemma4-31b-planning -f ./pc-rtx5090-32gb/gemma4-31b-it-q4_K_M.txt
ollama create qwen3.6-27b-planning -f ./pc-rtx5090-32gb/qwen3.6-27b-q4_K_M.txt
ollama create qwen3.6-35b-coding -f ./pc-rtx5090-32gb/qwen3.6-35b-a3b-q4_K_M.txt
ollama list
```
## Repository Structure
```
ollama-modelfiles/
├── README.md
├── mac-m3-24gb/
│ ├── gemma4-26b-a4b-it-q4_K_M.txt
│ ├── gemma4-31b-it-q4_K_M.txt
│ └── qwen3.6-27b-q4_K_M.txt
├── pc-rtx5090-32gb/
│ ├── gemma4-26b-a4b-it-q4_K_M.txt
│ ├── gemma4-31b-it-q4_K_M.txt
│ ├── qwen3.6-27b-q4_K_M.txt
│ └── qwen3.6-35b-a3b-q4_K_M.txt
└── scripts/
└── ollama-network-expose.ps1
```
## Parameters
| Parameter | Value | Description |
|-----------|-------|-------------|
| `num_ctx` | varies by model/hardware | Maximum context length in tokens |
| `temperature` | 0.2 (coding) / 0.5 (planning) | Coding = deterministic output; Planning = more creative reasoning |
| `repeat_penalty` | 1.1 | Prevents repetitive outputs |
## VS Code Copilot Chat
After setup, select the desired model in Copilot Chat:
- `gemma4-26b-coding`
- `gemma4-31b-planning`
- `qwen3.6-35b-coding` *(PC only)*
- `qwen3.6-27b-planning`
## Scripts
| Script | Platform | Description |
|--------|----------|-------------|
| `scripts/ollama-network-expose.ps1` | Windows (PowerShell) | Exposes the Ollama API on the local network |
```powershell
# Run from repo root in an elevated PowerShell session
powershell -ExecutionPolicy Bypass -File .\scripts\ollama-network-expose.ps1
```