Research
We pursue foundational questions in AI with rigor and openness. Every result we publish includes code, data, and the full experimental setup to reproduce it.
Areas
We focus where impact is highest and under-served: efficient training, hardware portability, multimodal systems, and alignment.
Architecture research focused on reducing compute per token without sacrificing downstream performance. We study sparse attention, mixture-of-experts routing, and training curriculum design.
Unified models that process images, text, and sensor data — deployable on edge hardware. We benchmark VLA models across standard robotics and embodied AI tasks.
Compiler-level abstractions that port models across accelerators with minimal throughput loss. Covers operator fusion, graph compilation, and mixed-precision strategies.
Feature visualization, probing classifiers, and mechanistic interpretability applied to frontier models. We also develop adversarial evaluation suites for alignment benchmarking.
Publications
All our work is published openly. Preprints appear on arXiv before formal peer review.
Our first papers are in preparation. Join the waitlist to be notified when we publish.
Model Coverage
Select a silicon type to see which model families run out of the box versus with our compatibility layer.
| # | Family | Model | Type | Stock Trainium | With Luma |
|---|---|---|---|---|---|
| 1 | Decoder LLM | distilgpt2 | Text-to-text | ✕ NaN loss | ✓ trains |
| 2 | Encoder | distilbert-base-uncased | Text encoder | ✓ trains | ✓ trains |
| 3 | Encoder-decoder | t5-small | Text-to-text | ✓ trains | ✓ trains |
| 4 | ViT | vit-base-patch16-224 | Image classification | ✓ trains | ✓ trains |
| 5 | CNN | resnet-18 | Image classification | ✓ trains | ✓ trains |
| 6 | Diffusion UNet | ddpm-cifar10-32 | Image generation | ✕ no converge | ✓ trains |
| 7 | STT | whisper-tiny | Speech-to-text | ✓ trains | ✓ trains |
| 8 | VLA | smolvla_base | Vision-language-action | ✕ compile error | ✓ trains |
| 9 | VLM | SmolVLM-256M-Instruct | Image-text-to-text | ✕ crash | ✓ trains |
| 10 | MoE | switch-base-8 | Text-to-text (MoE) | ✕ hangs | ✓ trains |
Benchmarks
We maintain and contribute to open evaluation suites covering language understanding, reasoning, multimodal perception, and alignment.
Evaluation suite for [describe what it measures]. Includes [N] tasks across [domains].
Evaluation suite for [describe what it measures]. Includes [N] tasks across [domains].
Evaluation suite for [describe what it measures]. Includes [N] tasks across [domains].