CCL-Bench · Benchmark Results

CCL-Bench is a collaborative benchmarking project studying how distributed ML workloads behave across different hardware, frameworks, and communication libraries. As models scale, a distributed implementation on hardware A with library X can behave drastically differently from hardware B with library Y. Each group collects traces and contributes analysis tools, which are then applied cross-group — making the benchmark scalable without brute-force exploration. Metrics include MFU, estimated memory bandwidth, step time, and more.

Part of the Cornell sysphotonics research group · Maintainers: Eric Ding, Kaiwen Guo, Jelena Gvero, Byungsoo Oh, Bhaskar Kataria, Atharv Sonwane, Rachee Singh · Contact: Eric Ding · Contributing: GitHub

⇧ Upload Trace & Workload Card

① Contributor

② Model & Data

③ Hardware

④ Framework

⑤ Files & Upload

Workload Name *

Workload name is required

Contributor Name *

Contributor name is required

Contact Email *

Valid email is required

Description (optional)

HuggingFace URL (optional)

Trace URL (optional)

Model Family *

Model family is required

Phase *

Phase is required

Precision

MoE

Granularity (optional)

Epochs (optional)

Iterations (optional)

Batch Size *

Batch size is required

Sequence Length *

Sequence length is required

Dataset (optional)

Type *

Hardware type is required

Model *

Hardware model is required

Total Count *

Total count is required

Per Node

Driver Version (optional)

Topology

Scale-out BW (Gbps)

Scale-up BW (Gbps)

Framework *

Framework is required

Compiler (optional)

DP Replicate

DP Shard

EP (optional)

CP (optional)

PP Microbatch (optional)

Comm Library

Comm Library Version

Protocols (comma-separated)

Comm Env Vars (KEY=VALUE, comma-separated)

Trace Types (comma-separated)

Metric-specific Traces (comma-separated)

📦

Drag & drop trace files here

or browse your computer

.nsys-rep, .json, .tar, .gz, .tgz, .zip, .bz2, .xz, .zst · Max 500 MB

What happens on upload:
① Your trace files are saved to the server
② A workload_card.yaml is auto-generated from the form fields above
③ Both are stored under uploads/<workload-name>/

Uploading…