Engineer Capability Assessment System (EN)

System 2026-03-01

Engineer Capability Assessment System

AI-powered system for evaluating engineering capabilities by analyzing GitHub and Gitee commits, code changes, and collaboration patterns using a six-dimensional evaluation framework.

Overview

This system uses LLM-powered analysis (Claude 4.5 Haiku) to evaluate software engineers based on actual code commits, diffs, and collaboration patterns from GitHub and Gitee repositories. Features a modern React + Antd dashboard and FastAPI backend with intelligent caching.

Usage

Quick Start: Full Context + Cached Evaluation (Best Approach! ⭐)

Evaluate contributors one at a time using full repository context with caching:

# Evaluate all contributors with caching
python full_context_cached_evaluator.py

# Or evaluate specific contributor
python full_context_cached_evaluator.py "contributor_name"

# Force re-evaluation (ignore cache)
python full_context_cached_evaluator.py "contributor_name" --force

Why This is Best:

✅ Full repo context (~650k tokens) for accuracy
✅ Evaluate ONE contributor at a time for flexibility
✅ Caches each result to avoid re-evaluation
✅ Add new contributors later without re-evaluating everyone
📊 ~100-200k tokens per contributor (first time)
💰 ~$0.005-0.01 per contributor (first time)
🆓 FREE when using cache!

See BEST_APPROACH.md for full details.

Alternative: Full Repo Analysis (All at Once)

Evaluate ALL contributors in ONE API call:

python full_repo_evaluator.py

Uses ~650k tokens, costs ~$0.03
Evaluates all contributors at once
No caching (re-evaluates everything each time)

See FULL_REPO_ANALYSIS_EXPLAINED.md for details.

Alternative: Server + Dashboard

For API-based evaluation with caching:

1. Configure Ports (Optional)

Edit .env.local files to configure ports (defaults: evaluator=8000, webapp=3000):

# evaluator/.env.local
PORT=8000

# webapp/.env.local
PORT=3000

2. Start the Services

Development Mode (with auto-reload):

./start_dev.sh

Production Mode (optimized):

./start_production.sh

Evaluator Only:

./start_server.sh

3. Access the Application

Webapp Dashboard (dev): http://localhost:3000/ (or your configured PORT)
API Server (dev): http://localhost:8000 (or your configured PORT)
API Documentation: http://localhost:8000/docs

Notes:

In development, the frontend (3000) and backend (8000) run on different origins. The CLI injects NEXT_PUBLIC_API_SERVER_URL=http://localhost:8000 so the dashboard calls the correct API.
In the published PyPI package, the dashboard is served by the backend at http://localhost:8000/ (same-origin), so leaving NEXT_PUBLIC_API_SERVER_URL unset is the intended setup.

4. Analyze Engineers

Via Dashboard:

Enter a GitHub/Gitee repository URL
Click “Analyze Repository”
Select a contributor to evaluate
View AI-powered evaluation results with scores and charts

Via API:

# Evaluate a contributor (full analysis of all commits)
curl -X POST "http://localhost:8000/api/evaluate/octocat/Hello-World/octocat"

Moderate Mode (Diffs + Files):

# Per-contributor evaluation with file context
uv run python -m evaluator.example_moderate_evaluation

Evaluation Approaches Comparison

Approach	Tokens	API Calls	Cost	Caching	Best For
Full Context + Cached ⭐	~100-200k each	1 per contributor	$0.005-0.01 each	✅ Yes	Complete + Flexible
Full Repo Analysis	~650k	1	$0.03	❌ No	One-time team evaluation
Moderate per-contributor	~45k each	N	$0.005×N	❌ No	Individual assessments
Conservative (diffs only)	~3k each	N	$0.0002×N	❌ No	Quick screening

Recommended: Use Full Context + Cached for best accuracy and flexibility!

See BEST_APPROACH.md and TOKEN_USAGE_VISUAL_COMPARISON.md for detailed comparison.

Six-Dimensional Evaluation Framework

1. AI Model Full-Stack & Trade-off Capability (AI模型全栈与权衡能力)

Focus Areas: Research-production mutual promotion innovation, system optimization

Deep learning framework usage and optimization
Model selection and trade-off decisions
End-to-end AI system implementation

2. AI Native Architecture & Communication Design (AI原生架构与沟通设计)

Focus Areas: Production-level platform, research-production mutual promotion innovation

AI-first architecture design
API and interface design for AI systems
Documentation and communication patterns

3. Cloud Native & Constraint Engineering (云原生与约束工程化)

Focus Areas: Production-level platform, system optimization

Containerization and orchestration
Infrastructure as code
CI/CD pipeline implementation
Resource optimization and constraints

4. Open Source Collaboration & Requirements Translation (开源协作与需求转化)

Focus Areas: Open source co-construction, research-production mutual promotion innovation

Open source contribution quality and frequency
Issue management and PR reviews
Requirements analysis and implementation

5. Intelligent Development & Human-Machine Collaboration (智能开发与人机协同)

Focus Areas: All specialties

AI-assisted development practices
Code generation and review
Automation and tooling

6. Engineering Leadership & System Trade-offs (工程领导与系统权衡)

Focus Areas: System optimization, production-level platform, research-production mutual promotion innovation

Technical decision making
System architecture trade-offs
Team collaboration and mentorship

Features

AI-Powered Analysis: Uses Claude 4.5 Haiku to analyze actual code changes and commit patterns
Dual Platform Support: Analyzes both GitHub and Gitee repositories
Smart Caching: Local storage system to minimize API calls and LLM token usage
Modern UI: React + Antd + Antd-x dashboard with radar charts and detailed breakdowns
Real Commit Analysis: Evaluates actual diffs, file changes, and code quality (not just keywords)

Input & Output

Input:

Repository URLs (GitHub/Gitee)
Contributor username

Output:

Six-dimensional scores (0-100 scale)
AI-generated reasoning and analysis
Commit statistics and code metrics
Visual radar chart and detailed breakdowns

Installation

1. Install (uv recommended)

This repo uses pyproject.toml and the uv toolchain:

# First-time setup (if `uv.lock` is not present in the repo)
uv lock

# Then sync dependencies (creates/updates .venv)
uv sync

# Quick start without lock (not reproducible):
# uv sync --no-lock

2. Configure Environment Variables

Create a .env.local file with your API keys:

# OpenRouter API key for LLM evaluation (required)
OPEN_ROUTER_KEY=sk-or-v1-your-key-here

# GitHub token (optional but strongly recommended)
# Without token: ~60 requests/hour; With token: 5,000 requests/hour
GITHUB_TOKEN=ghp_your-token-here

# Gitee token (optional but strongly recommended)
# Without token: low rate limits; With token: higher rate limits
GITEE_TOKEN=your-gitee-token-here

If you’re using fish, you can also set env vars like:

set -x OPEN_ROUTER_KEY sk-or-v1-your-key-here
set -x GITHUB_TOKEN ghp_your-token-here

See .env.example for reference.

LLM Provider: OpenAI-compatible (matches `build/test_qwen.py` style)

Prefer using the interactive initializer (creates/updates .env.local and asks before overwriting):

uv run oscanner init

By default the backend uses OpenRouter (OPEN_ROUTER_KEY). If you use an OpenAI-compatible Chat Completions provider, configure:

OSCANNER_LLM_BASE_URL=https://api.siliconflow.cn/v1
OSCANNER_LLM_API_KEY=sk-your-key-here
OSCANNER_LLM_MODEL=qwen/qwen3-coder-flash

# Optional: comma-separated fallbacks
# OSCANNER_LLM_FALLBACK_MODELS=qwen/qwen3-coder-flash,another-model-id

Note: OSCANNER_LLM_BASE_URL auto-appends /chat/completions. If your provider has a non-standard path, set:

OSCANNER_LLM_CHAT_COMPLETIONS_URL=https://api.siliconflow.cn/v1/chat/completions

CLI: Start Backend / Frontend / Both

Backend only (FastAPI, with reload):

uv run oscanner serve --reload

Frontend only (Next.js dashboard, from repo checkout):

uv run oscanner dashboard --install

One-command dev (backend + frontend):

uv run oscanner dev --reload --install

Note: the frontend lives in webapp/ and is not included in PyPI installs. If webapp/ is missing, run:

oscanner dashboard --print

to see instructions.

Project Structure

.
├── README.md                   # Chinese README (default)
├── README_en.md                # English README
├── pyproject.toml              # Python project metadata (uv/packaging)
├── evaluator/
│   ├── __init__.py
│   ├── core.py                # Main evaluation engine
│   ├── dimensions.py          # Six dimension evaluators
│   ├── plugins/*/scan/__init__.py    # Plugin-provided evaluators (self-contained)
│   ├── paths.py               # Data/cache path strategy (user-dir by default)
│   ├── collectors/
│   │   ├── __init__.py
│   │   ├── github.py          # GitHub API integration
│   │   └── gitee.py           # Gitee API integration
│   ├── analyzers/             # (Reserved for future use)
│   └── tools/                 # Repo extraction tools
├── oscanner/
│   ├── cli.py                 # CLI entrypoint (oscanner)
│   └── __main__.py            # python -m oscanner
└── webapp/                    # Optional dashboard (Next.js)

Contributing

We recommend contributing via Gitee’s auto-generated PR workflow. See CONTRIBUTING.md for details.

Quick Start:

Create or select an issue on Gitee
Develop directly on the main branch
Reference the issue in your commit message: fix #issue_number or 关闭 #issue_number
Push to trigger auto PR generation and issue linking

How It Works

1. Commit Collection

Fetches commits from GitHub/Gitee API
Retrieves detailed commit data including files changed and diffs
Caches data locally in a user directory by default (override via OSCANNER_DATA_DIR)

2. LLM-Powered Analysis

The system sends commit data to Claude 4.5 Haiku with:

Commit messages and descriptions
File changes (additions/deletions by language)
Code diffs (patches)
Commit statistics and patterns

3. Evaluation Criteria

The LLM evaluates based on actual code evidence:

AI Full-Stack

ML framework usage (TensorFlow, PyTorch, etc.)
Model architecture implementations
Training and optimization code
Model deployment patterns

AI Architecture

API design quality
Service architecture patterns
Documentation quality
Integration and interface design

Cloud Native

Docker/Kubernetes configurations
CI/CD pipeline definitions
Infrastructure as Code
Cloud platform integration

Open Source Collaboration

Commit message clarity
Issue/PR references and linking
Code review participation
Refactoring and improvement quality

Intelligent Development

Test coverage and automation
Development tooling and scripts
Build configurations
AI-assisted development practices

Engineering Leadership

Performance optimizations
Security considerations
Best practice adoption
Architectural decision making

4. Scoring

Each dimension scored 0-100
Scores based on actual code analysis (not keywords)
LLM provides detailed reasoning for each evaluation
Caching ensures efficient token usage

API Endpoints

Endpoint	Method	Description
`/`	GET	API information
`/health`	GET	Health check
`/api/evaluate/{owner}/{repo}/{username}`	POST	Evaluate engineer capabilities
`/api/cache/stats`	GET	View cache statistics
`/api/cache/clear`	DELETE	Clear all cached data

License

MIT License