Achyuthan-S
Achyuthan Sivasankar

@achyuthan-s · NYC

NYU MS CS · Fall '27 ML Research
Download CV

MS CS · NYU · Graduating Fall 2027

Achyuthan |

Achyuthan Sivasankar

Achyuthan Sivasankar

Adaptive Computation · MoE Systems · Efficient Architectures

## About

I build and study adaptive routing in sparse neural architectures — from function-basis routing in KAN layers to expert-collapse dynamics in large MoE language models.

## Research highlights
+1,722 FSD lead steps before grokking (9 configs)
+6.8% CIFAR-100 vs MLP (KAN-Multi)
99.67% SWELL-KW stress detection (NUS)
MoE-Bench Open-source · Apache 2.0 arXiv Grokking paper · 2606.12966
JEPA AD-LiST-JEPA · Waymo LiDAR (NYU lab)
## Research interests
  • PyTorch · JAX · HuggingFace
  • Mechanistic interpretability · Grokking
  • MoE architectures · KAN layers
  • LoRA / PEFT · RAG · LLM fine-tuning
  • Self-supervised learning · Computer Vision
  • Python · Docker · AWS · FastAPI
## Education & affiliations
  1. MS Computer Science

    New York University

    Graduating Fall 2027 · NYC

  2. Research Assistant — Prof. Anna Choromanska's Lab

    NYU · AD-LiST-JEPA · Waymo Open Dataset

    May 2026 – Present

  3. Research Intern — Prof. Sunil Chandran

    IISc Bangalore · GNN routing · EEG / BCI

    Jun – Aug 2024

  4. AI Research Intern

    National University of Singapore

    Dec 2023 – Feb 2024

## Currently

MS CS student at NYU · Graduating Fall 2027 · Research Assistant in Prof. Anna Choromanska's lab · Targeting PhD programs in core ML

GitHub · Contributions

Hover or tap for details

ML / Research

PyTorch

PyTorch

JAX

JAX

HF

HuggingFace

MoE

MoE architectures

KAN

KAN layers

LoRA

LoRA / PEFT

RAG

RAG · LLM fine-tuning

SSL

Self-supervised learning

Languages

Python

Python (expert)

C/C++

C / C++

SQL

SQL

MLOps & Infra

Docker

Docker

FastAPI

FastAPI · Flask

AWS

AWS · S3 · EC2

Git

Git · Linux

Frameworks & Tools

TF

TensorFlow / Keras

Sklearn

Scikit-learn

NumPy

NumPy · Pandas

W&B

Weights & Biases

Pinned Repositories & Research

moe-bench Public

Open routing entropy benchmark for sparse MoE LLMs. Pip-installable, Mac MPS validated, Apache 2.0.

MoE Benchmark Apache 2.0
Python Shipped
rag-acga-knowledge-base-memory-system Public

Adaptive Corrective Graph-Augmented RAG with 4-layer memory. 85ms cold latency, 15.5× cached speedup. Production-ready, zero-cost deployment.

RAG Graph Production
Python 15.5× cache
KAN-Multi No public repo

Multi-basis adaptive routing layer with 6 function families and statistics-driven router. +6.8% over MLP on CIFAR-100. Manuscript in prep — no public repo yet.

KAN Routing CIFAR-100
PyTorch +6.8% vs MLP
AutoMoE No public repo

Evolutionary REINFORCE meta-learner discovering optimal MoE topologies. 2× parameter efficiency on Meta-World ML10. Research project — no public repo yet.

MoE Meta-Learning ML10
Python 2× efficiency

Let's talk
research.

I'm open to research collaborations, PhD inquiries, and ML engineering roles. If you're working on adaptive computation, efficient architectures, or autonomous perception — reach out.

Currently based in New York City · MS CS at NYU · Graduating Fall 2027

Opens your email client with your message pre-filled.