Feeding GPU clusters at wire‑rate with RDMA + GPUDirect Storage to lift utilization and reduce cost.
NorthFS builds GPU‑native distributed file systems and a distributed KV cache so training and inference aren’t bottlenecked by storage. We remove slow TCP data paths, kernel copies, and object‑store latency with an RDMA‑first design and zero‑copy reads into GPU memory.
Modern GPU jobs frequently idle waiting for data. Typical paths introduce latency, tail stalls, and poor cache locality. At scale this burns budget and elongates training/inference SLAs.
NorthFS is an RDMA‑first, GDS‑enabled file system with async prefetching and admission control. We bypass kernel copies, stripe across NVMe, and read directly into GPU memory buffers.
Enterprise AI/ML platform teams and HPC orgs operating 10–1000+ GPUs in cloud, on‑prem, or hybrid environments. Drop‑in for training, inference, and data preprocessing pipelines.
RDMA + GDS optimized, POSIX‑style interface. Works across cloud or on‑prem clusters with connectors for S3/HDFS and Python/CLI SDKs.
Stage: Private Beta (design partners onboarding).
Low‑latency, multi‑node KV cache for serving attention KV blocks across GPUs with near‑local access. Built for vLLM/SGLang‑style inference.
Stage: Alpha (prototype under active development).
Python & CLI integration for quick adoption in training/inference pipelines with minimal code change.
Stage: Beta.
RDMA + GDS optimized file system purpose‑built for GPUs. Deployable across cloud, on‑prem, or hybrid.
Serve KV blocks across GPU nodes with near‑local latency. Works with frameworks like vLLM and SGLang.
Globally consistent metadata layer for coordination, leasing, and scale‑out.
Plug into training/inference pipelines with minimal code change.
Ex-Uber, Distributed systems & AI. Contributor to Apache Spark, Velox, Gluten, Pinot. Author of peer‑reviewed papers in systems, big data, and AI.
Ex-Uber, Distributed systems engineer; builds on Hive/Spark and cloud platforms. Focus on large‑scale data infra and reliability.
Interested in the private beta or design partnership?