LOOM-Eval
Getting Started
Installation
Basic Installation
Step 1: Create Environment
Step 2: Install LOOM-Eval
Step 3: Install Flash Attention
Acceleration Methods
General Acceleration Environment
KIVI Installation
ThinK Installation
FlexPrefill Installation
XAttention Installation
Other Acceleration Methods
RAG Installation
Next Steps
Quick Start
Prerequisites
Installation
Basic Usage
Automatic Evaluation (Recommended)
Manual Evaluation (Step-by-Step)
Key Parameters
Core Parameters
Advanced Parameters
WebUI Usage
Interactive Evaluation
Example Scenarios
Scenario 1: Quick Test with Limited Samples
Scenario 2: Multi-GPU Long Context Evaluation
Scenario 3: Using vLLM for Faster Inference
Scenario 4: API Interface Usage
Scenario 5: RAG-Enhanced Evaluation
Scenario 6: Acceleration for Memory Efficiency
Running LOOMBench Suite
Option 1: Run Full LOOMBench Suite
Common Issues
Output Structure
Next Steps
User Guide
Acceleration Methods
KV Cache Optimization
Sparse Attention
Usage
Performance (128K Context)
Model Compatibility
Installation Notes
Hardware Requirements
API Reference
Command Line Interface
Main Commands
Core Parameters
Inference Options
Acceleration Options
RAG Options
Generation & Logic Options
Data & Sampling Options
API Configuration Options
Execution Options
Extension Options
Storage Strategy Options
Custom Templates
Examples
Basic Evaluation (Automatic)
With KV Cache Acceleration
With RAG Enhancement
vLLM High-Throughput Backend
API Model (OpenAI/Anthropic)
Sparse Attention (NSA/MOBA)
Step-by-Step Manual Evaluation
RAG (Retrieval-Augmented Generation)
Supported Methods
Installation
Quick Start
Configuration
Command-line Options
Task Compatibility
Best Practices
LOOM-Eval
Search
Please activate JavaScript to enable the search functionality.