About Modal
What is Modal?
Modal delivers a purpose-built platform for AI and data workloads, enabling teams to deploy inference, training, and batch processing without wrestling with infrastructure. With features like sub-second container startup, instant autoscaling to thousands of GPUs or containers, and usage-based pricing, Modal abstracts away DevOps so developers can focus on building models and applications. It supports inference of large language models, image/video/audio generation, fine-tuning, sandboxes for untrusted code, job scheduling and large-scale batch workloads. The platform emphasises performance, developer experience and enterprise-grade governance including SOC2/HIPAA compliance and data-residency controls.
How to Use Modal
Key Features of Modal
Containers launch in seconds so feedback loops remain tight and latency stays low.
Scale from zero to thousands of nodes or GPUs automatically to handle high-volume or burst workloads.
Deploy functions and containers with minimal configuration files—defining environment and hardware requirements inline.
Run production inference of language, vision, and audio models, fine-tune models, execute large-scale batch workflows or sandbox untrusted code in one platform.
Access thousands of GPUs across clouds, shrink to zero when idle, and pay only for what you use.
Provides SOC2 & HIPAA compliance, team access controls, isolation, and data-residency controls suitable for enterprise deployment.






