Skip to content

Compute OS

Compute OS is the operational substrate for scientific software, not a hardware catalog.

Subsystems

  • Cluster: Slurm, Kubernetes, queues, and quota policy.
  • Network: IB, RDMA, and NCCL communication tuning.
  • Accelerator: governance across GPU/NPU/DPU pools.
  • Compiler: Triton, TVM, and graph-level optimization.
  • Observability: reliability and platform visibility.

Objective

Provide a stable, scalable, and measurable compute operating layer for the Scientific Software Factory.

AI-HPC Organization · Contact: openaihpc@gmail.com