Quickstart additional commands

gabrielle-ong · gabrielle-ong · commit e6a5548c18cb · 2024-10-30T19:46:12.000+08:00
diff --git a/docs/docs/quickstart.mdx b/docs/docs/quickstart.mdx
@@ -20,7 +20,7 @@ Cortex has an Local Installer that packages all required dependencies, so that n
   - [Mac (Universal)](https://app.cortexcpp.com/download/latest/mac-universal-local)
   - [Linux](https://app.cortexcpp.com/download/latest/linux-amd64-local)
 
-## Start Cortex.cpp Processes and API Server
+## Start Cortex.cpp API Server
 This command starts the Cortex.cpp API server at `localhost:39281`.
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
@@ -35,17 +35,38 @@ This command starts the Cortex.cpp API server at `localhost:39281`.
   </TabItem>
 </Tabs>
 
+## Pull a Model & Select Quantization
+This command allows users to download a model from these Model Hubs:
+- [Cortex Built-in Models](https://cortex.so/models)
+- [Hugging Face](https://huggingface.co) (GGUF): `cortex pull <author/ModelRepo>`
+
+It displays available quantizations, recommends a default and downloads the desired quantization. 
+<Tabs>
+  <TabItem value="MacOs/Linux" label="MacOs/Linux">
+  ```sh
+  $ cortex pull llama3.2 
+  $ cortex pull bartowski/Meta-Llama-3.1-8B-Instruct-GGUF
+  ```
+  </TabItem>
+  <TabItem value="Windows" label="Windows">
+  ```sh
+  $ cortex pull llama3.2 
+  $ cortex.exe pull bartowski/Meta-Llama-3.1-8B-Instruct-GGUF
+  ```
+  </TabItem>
+</Tabs>
+
 ## Run a Model
 This command downloads the default `gguf` model format from the [Cortex Hub](https://huggingface.co/cortexso), starts the model, and chat with the model.
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
-  cortex run mistral
+  cortex run llama3.2
   ```
   </TabItem>
   <TabItem value="Windows" label="Windows">
   ```sh
-  cortex.exe run mistral
+  cortex.exe run llama3.2
   ```
   </TabItem>
 </Tabs>
@@ -78,33 +99,49 @@ curl http://localhost:39281/v1/chat/completions \
   "top_p": 1
 }'
 ```
+Refer to our [API documentation](https://cortex.so/api-reference) for more details.
+
+## Show the System State
+This command displays the running model and the hardware system status (RAM, Engine, VRAM, Uptime)
+<Tabs>
+  <TabItem value="MacOs/Linux" label="MacOs/Linux">
+  ```sh
+  cortex ps
+  ```
+  </TabItem>
+  <TabItem value="Windows" label="Windows">
+  ```sh
+  cortex.exe ps
+  ```
+  </TabItem>
+</Tabs>
 
 ## Stop a Model
 This command stops the running model.
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
-  cortex models stop mistral
+  cortex models stop llama3.2
   ```
   </TabItem>
   <TabItem value="Windows" label="Windows">
   ```sh
-  cortex.exe models stop mistral
+  cortex.exe models stop llama3.2
   ```
   </TabItem>
 </Tabs>
 
-## Show the System State
-This command displays the running model and the hardware system status.
+## Stop Cortex.cpp API Server
+This command starts the Cortex.cpp API server at `localhost:39281`.
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
-  cortex ps
+  cortex stop
   ```
   </TabItem>
   <TabItem value="Windows" label="Windows">
   ```sh
-  cortex.exe ps
+  cortex.exe stop
   ```
   </TabItem>
 </Tabs>
@@ -137,4 +174,3 @@ Now that Cortex.cpp is set up, here are the next steps to explore:
 1. Adjust the folder path and configuration using the [`.cortexrc`](/docs/basic-usage/cortexrc) file.
 2. Explore the Cortex.cpp [data folder](/docs/data-folder) to understand how it stores data.
 3. Learn about the structure of the [`model.yaml`](/docs/model-yaml) file in Cortex.cpp.
-4. Integrate Cortex.cpp [libraries](/docs/category/libraries) seamlessly into your Python or JavaScript applications.