Pepsi-Challenge: LLM Model Evaluation

Overview

Pepsi-Challenge is a server application for testing and validating Stacklok fine-tuned models. It provides support for both NVIDIA GPUs (CUDA) and Apple Silicon (Metal) platforms.

Hardware Support

NVIDIA GPUs: CUDA 12.1
Apple Silicon: Metal Performance Shaders (MPS)

Getting Started

Prerequisites

Python 3.12
Node.js v22.x (for frontend)
NVIDIA GPU with CUDA 12.1 or Apple Silicon device

Installation

Clone the repository:

git clone https://github.com/stacklok/pepsi-challenge.git
cd pepsi-challenge

Set up the backend:

cd backend

# Create and activate Python virtual environment
python3.12 -m venv venv
source venv/bin/activate  # Unix/macOS
# or
.\venv\Scripts\activate  # Windows

# Install dependencies based on platform
# For MacOS:
pip install -r requirements-macos.txt
# For CUDA systems:
pip install -r requirements-cuda.txt

Configuration

Backend Environment Setup (backend/.env):

# GitHub OAuth
GITHUB_CLIENT_ID=your-github-client-id
GITHUB_CLIENT_SECRET=your-github-client-secret
GITHUB_CALLBACK_URL=https://your-domain.com/auth/callback

# Model Configuration
BASE_MODEL_NAME=Qwen/Qwen2.5-Coder-0.5B
FINETUNED_MODEL_NAME=stacklok/Qwen2.5-Coder-0.5B-codegate
FRONTEND_URL=https://your-domain.com

# Access Control
ALLOWED_USERS=admin1,admin2
ADMIN_USERS=admin1,admin2

# Development Settings
WATCHFILES_FORCE_POLLING=false
WATCHFILES_IGNORE_PATHS=*/unsloth_compiled_cache/*

Frontend Environment Setup (frontend/.env):

NEXT_PUBLIC_BACKEND_URL=http://localhost:5000
NEXT_PUBLIC_API_URL=http://localhost:5000

GitHub OAuth Setup

Create a new OAuth application at https://github.com/settings/developers
Configure the application:
- Homepage URL: https://your-domain.com
- Authorization callback URL: https://your-domain.com/auth/callback
Copy the client ID and secret to your backend .env file

Running the Application

Local Development

Start the Backend in Development Mode:

cd backend
# Enable hot-reload for development
uvicorn main:app --host 127.0.0.1 --port 5000 --reload

Start the Frontend in Development Mode:

cd frontend
# Install dependencies if not already done
npm ci
# Start development server
npm run dev

The application will be available at:

Frontend: http://localhost:3000
Backend API: http://localhost:5000

For local development, ensure your .env files are configured with local URLs:

Backend (backend/.env):

FRONTEND_URL=http://localhost:3000
GITHUB_CALLBACK_URL=http://localhost:5000/auth/callback
LOCAL_ENV=True

Frontend (frontend/.env):

NEXT_PUBLIC_BACKEND_URL=http://localhost:5000
NEXT_PUBLIC_API_URL=http://localhost:5000

Production Deployment

We recommend building a Docker container for the frontend and backend, and then using that to run the application. Note that the backend uses CUDA, so you'll need a build machine that has CUDA and a GPU passed through. You'll also need to pass through a GPU to the backend container; the frontend container is just a normal-non-GPU container.

Start the Backend:

cd backend
uvicorn main:app --host 127.0.0.1 --port 5000

Start the Frontend:

cd frontend
npm install --omit=dev
npm run build
npm run start

Access Control

Regular Users: Configure allowed GitHub usernames in ALLOWED_USERS
Administrators: Configure admin GitHub usernames in ADMIN_USERS

These users will have access to the application and admin panel respectively.

Config Models

At the moment, the application supports running the comparison of the two models. These should be configured in the .env file.

FINETUNED_MODEL_NAME=stacklok/Qwen2.5-Coder-0.5B-codegate
BASE_MODEL_NAME=Qwen/Qwen2.5-Coder-0.5B

The application will then perform a comparison of the two models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Pepsi-Challenge: LLM Model Evaluation

Overview

Hardware Support

Getting Started

Prerequisites

Installation

Configuration

GitHub OAuth Setup

Running the Application

Local Development

Production Deployment

Access Control

Config Models

Files

README.md

Latest commit

History

README.md

File metadata and controls

Pepsi-Challenge: LLM Model Evaluation

Overview

Hardware Support

Getting Started

Prerequisites

Installation

Configuration

GitHub OAuth Setup

Running the Application

Local Development

Production Deployment

Access Control

Config Models