You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/content/advanced/microservices.md
+40-28Lines changed: 40 additions & 28 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -61,6 +61,17 @@ You will need to build the {{< short_app_ref >}} container images and stage them
61
61
podman push <server_repository>:latest
62
62
```
63
63
64
+
### Namespace
65
+
66
+
Create a Kubernetes namespace to logically isolate the {{< short_app_ref >}} resources. For demonstration purposes, the `ai-optimizer` namespace will be created and used throughout this documentation.
67
+
68
+
```yaml
69
+
apiVersion: v1
70
+
kind: Namespace
71
+
metadata:
72
+
name: ai-optimizer
73
+
```
74
+
64
75
### Ingress
65
76
66
77
To access the {{< short_app_ref >}} GUI and API Server, you can either use a port-forward or an Ingress service. For demonstration purposes, the [OCI Native Ingress Controller](https://docs.oracle.com/en-us/iaas/Content/ContEng/Tasks/contengsettingupnativeingresscontroller.htm), which was enabled on the **OKE** cluster as part of the **IaC**, will be used to for public Ingress access.
@@ -81,30 +92,25 @@ These will be output as part of the **IaC** but can be removed from the code if
@@ -125,20 +131,13 @@ These will be output as part of the **IaC** but can be removed from the code if
125
131
The {{< short_app_ref >}} can be deployed using the [Helm](https://helm.sh/) chart provided with the source:
126
132
[{{< short_app_ref >}} Helm Chart](https://github.com/oracle-samples/ai-optimizer/tree/main/helm). A list of all values can be found in [values_summary.md](https://github.com/oracle-samples/ai-optimizer/tree/main/helm/values_summary.md).
127
133
128
-
If you deployed a GPU node pool as part of the **IaC**, you can deploy Ollama and enable a Large Language and Embedding Model out-of-the-box.
129
-
130
-
1. Create the `ai-optimizer` namespace:
131
-
132
-
```bash
133
-
kubectl create namespace ai-optimizer
134
-
```
134
+
If you deployed a GPU node pool as part of the **IaC**, [Ollama](https://ollama.com/) will be deployed automatically and a Large Language and Embedding Model will be available out-of-the-box.
@@ -172,7 +170,7 @@ If you deployed a GPU node pool as part of the **IaC**, you can deploy Ollama an
172
170
173
171
These will be output as part of the **IaC**.
174
172
175
-
{{< icon "star" >}} If using the **IaC** for **OCI**, it is not required to specify an ImagePullSecret as the cluster nodes are configured with the [Image Credential Provider for OKE](https://github.com/oracle-devrel/oke-credential-provider-for-ocir).
173
+
{{< icon "star" >}} If using the **IaC** for **OCI**, it is not required to specify an ImagePullSecret as the cluster nodes are configured with the [Image Credential Provider for OKE](https://github.com/oracle-devrel/oke-credential-provider-for-ocir). It may take up to 5 minutes for the policy allowing for the image pull to be recognized.
176
174
177
175
```yaml
178
176
global:
@@ -185,6 +183,7 @@ If you deployed a GPU node pool as part of the **IaC**, you can deploy Ollama an
185
183
image:
186
184
repository: <server_repository>
187
185
tag: "latest"
186
+
imagePullPolicy: Always
188
187
189
188
ingress:
190
189
enabled: true
@@ -198,20 +197,33 @@ If you deployed a GPU node pool as part of the **IaC**, you can deploy Ollama an
198
197
http:
199
198
type: "NodePort"
200
199
200
+
# -- Oracle Cloud Infrastructure Configuration
201
+
oci:
202
+
tenancy: "<tenancy_ocid>"
203
+
region: "<oci_region>"
204
+
201
205
# -- Oracle Autonomous Database Configuration
202
206
adb:
203
207
enabled: true
204
208
ocid: "<adb_ocid>"
205
209
mtls:
206
-
enabled: false
210
+
enabled: true
207
211
authN:
208
212
secretName: "db-authn"
213
+
usernameKey: "username"
214
+
passwordKey: "password"
215
+
serviceKey: "service"
216
+
217
+
models:
218
+
ollama:
219
+
enabled: false
209
220
210
221
client:
211
222
enabled: true
212
223
image:
213
224
repository: <client_repository>
214
225
tag: "latest"
226
+
imagePullPolicy: Always
215
227
216
228
ingress:
217
229
enabled: true
@@ -229,10 +241,10 @@ If you deployed a GPU node pool as part of the **IaC**, you can deploy Ollama an
229
241
disableTestbed: "false"
230
242
disableApi: "false"
231
243
disableTools: "false"
232
-
disableDbCfg: "true"
244
+
disableDbCfg: "false"
233
245
disableModelCfg: "false"
234
-
disableOciCfg: "true"
235
-
disableSettings: "true"
246
+
disableOciCfg: "false"
247
+
disableSettings: "false"
236
248
237
249
ollama:
238
250
enabled: true
@@ -250,8 +262,8 @@ If you deployed a GPU node pool as part of the **IaC**, you can deploy Ollama an
format("allow any-user to manage waf-family in compartment id %s where all {request.principal.type = 'workload', request.principal.namespace = 'native-ingress-controller-system', request.principal.service_account = 'oci-native-ingress-controller', request.principal.cluster_id = '%s'}", var.compartment_id, oci_containerengine_cluster.default_cluster.id),
28
38
format("allow any-user to read cluster-family in compartment id %s where all {request.principal.type = 'workload', request.principal.namespace = 'native-ingress-controller-system', request.principal.service_account = 'oci-native-ingress-controller', request.principal.cluster_id = '%s'}", var.compartment_id, oci_containerengine_cluster.default_cluster.id),
29
39
format("allow any-user to use tag-namespaces in compartment id %s where all {request.principal.type = 'workload', request.principal.namespace = 'native-ingress-controller-system', request.principal.service_account = 'oci-native-ingress-controller', request.principal.cluster_id = '%s'}", var.compartment_id, oci_containerengine_cluster.default_cluster.id),
30
-
format("allow dynamic-group %s to manage repos in compartment id %s", var.dynamic_group, var.compartment_id),
40
+
format("allow dynamic-group %s to use generative-ai-family in compartment id %s", oci_identity_dynamic_group.workers_dynamic_group.name, var.compartment_id),
41
+
format("allow dynamic-group %s to manage repos in compartment id %s", oci_identity_dynamic_group.workers_dynamic_group.name, var.compartment_id),
0 commit comments