I am a Performance Engineer. I have observed many organizations are running heavy multi-modal pipelines.
I specialize in squeezing PyTorch models into custom C++/CUDA kernels.
I have a few fractional slots open to help startups optimize their VRAM footprints or slash their GPU cloud bills.
anany.mishra.dev@gmail.com
Projects and Work
Random Thoughts
Professional Me