← Back to Home

HELIX Research and Benchmark Materials

This page consolidates investor and technical diligence artifacts, including measured benchmark outputs, reproducibility notes, checkpoint coverage, and direct paper access.

Current Paper

Read the latest arXiv preprint and contact us for full patent portfolio review packets.

BenchmarkTemperatureMetricValueSteering RateSource Artifact
GSM8K1.0Accuracy91.80%1.72%ICML_Experiments/results/gsm8k_Cortex_(4-bit_Granite)_t1.0_final.json
GSM8K3.0Accuracy88.84%2.44%ICML_Experiments/results/gsm8k_Cortex_(4-bit_Granite)_t3.0_final.json
MMLU1.0Accuracy74.30%0.02%ICML_Experiments/results/mmlu_Cortex_(4-bit_Granite)_t1.0_final.json
MMLU3.0Accuracy72.49%0.40%ICML_Experiments/results/mmlu_Cortex_(4-bit_Granite)_t3.0_final.json
HumanEval1.0Pass@181.10%0.42%ICML_Experiments/results/humaneval_Cortex_(4-bit_Granite)_t1.0_final.json
HumanEval2.5Pass@180.49%0.33%ICML_Experiments/results/humaneval_Cortex_(4-bit_Granite)_t2.5_final.json

Artifact Coverage

  • • Final JSON result files: 43
  • • Checkpoint JSON files: 473
  • • Benchmark sets represented: GSM8K, MMLU, HumanEval, TruthfulQA

Data Integrity Notes

  • • Values shown above are from measured `summary` fields in final artifacts.
  • • Telemetry coverage in sampled finals is 1.0 (100%).
  • • Full raw files are available via diligence package on request.