Welcome to SkyPilot!
=========================
.. figure:: ./images/skypilot-wide-light-1k.png
:width: 60%
:align: center
:alt: SkyPilot
:class: no-scaled-link
.. raw:: html
Star
Watch
Fork
Run LLMs and AI on Any Cloud
SkyPilot is a framework for running LLMs, AI, and batch jobs on any cloud, offering maximum cost savings, highest GPU availability, and managed execution.
SkyPilot **abstracts away cloud infra burdens**:
- Launch jobs & clusters on any cloud
- Easy scale-out: queue and run many jobs, automatically managed
- Easy access to object stores (S3, GCS, R2)
SkyPilot **maximizes GPU availability for your jobs**:
* Provision in all zones/regions/clouds you have access to (`the Sky `_), with automatic failover
SkyPilot **cuts your cloud costs**:
* `Managed Spot `_: 3-6x cost savings using spot VMs, with auto-recovery from preemptions
* Optimizer: 2x cost savings by auto-picking the cheapest VM/zone/region/cloud
* `Autostop `_: hands-free cleanup of idle clusters
SkyPilot supports your existing GPU, TPU, and CPU workloads, with no code changes.
Current supported providers (AWS, Azure, GCP, Lambda Cloud, IBM, Samsung, OCI, Cloudflare, any Kubernetes cluster):
.. raw:: html
More Information
--------------------------
Tutorials: `SkyPilot Tutorials `_
Runnable examples:
* **LLMs on SkyPilot**
* `Mistral 7B `_ (from official Mistral team)
* `vLLM: Serving LLM 24x Faster On the Cloud `_ (from official vLLM team)
* `Vicuna chatbots: Training & Serving `_ (from official Vicuna team)
* `Train your own Vicuna on Llama-2 `_
* `Self-Hosted Llama-2 Chatbot `_
* `QLoRA `_
* `LLaMA-LoRA-Tuner `_
* `Tabby: Self-hosted AI coding assistant `_
* `LocalGPT `_
* `Falcon `_
* Add yours here & see more in `llm/ `_!
* Framework examples: `PyTorch DDP `_, `DeepSpeed `_, `JAX/Flax on TPU `_, `Stable Diffusion `_, `Detectron2 `_, `Distributed `_ `TensorFlow `_, `programmatic grid search `_, `Docker `_, and `many more `_.
Follow updates:
* `Twitter `_
* `Slack `_
* `SkyPilot Blog `_ (`Introductory blog post `_)
Read the research:
* `SkyPilot paper `_ and `talk `_ (NSDI 2023)
* `Sky Computing whitepaper `_
* `Sky Computing vision paper `_ (HotOS 2021)
Documentation
--------------------------
.. toctree::
:maxdepth: 1
:caption: Getting Started
getting-started/installation
getting-started/quickstart
getting-started/tutorial
examples/gpu-jupyter
.. toctree::
:maxdepth: 1
:caption: Running Jobs
reference/job-queue
reference/tpu
examples/auto-failover
running-jobs/index
.. toctree::
:maxdepth: 1
:caption: Cutting Cloud Costs
examples/spot-jobs
reference/auto-stop
reference/benchmark/index
.. toctree::
:maxdepth: 1
:caption: Using Data
examples/syncing-code-artifacts
reference/storage
.. toctree::
:maxdepth: 1
:caption: User Guides
examples/docker-containers
examples/ports
examples/iterative-dev-project
reference/interactive-nodes
reference/faq
reference/logging
reference/kubernetes/index
.. toctree::
:maxdepth: 1
:caption: Cloud Admin and Usage
cloud-setup/cloud-permissions/index
cloud-setup/cloud-auth
cloud-setup/quota
.. toctree::
:maxdepth: 1
:caption: References
reference/yaml-spec
reference/cli
reference/api
reference/config