Skip to content

Network Subsystem

Focus Areas

  • RDMA topology and congestion control
  • NCCL collective communication performance
  • Inter-zone routing and resilience strategy

Suggested Metrics

  • AllReduce throughput
  • End-to-end latency
  • Packet loss and retransmission rate

AI-HPC Organization · Contact: openaihpc@gmail.com