slashml
diff --git a/‎docs/api_reference/model_deployment.rst‎
Lines changed: 9 additions & 0 deletions b/‎docs/api_reference/model_deployment.rst‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎docs/services/snippets/job_status.md‎ ‎docs/api_reference/snippets/job_status.md‎docs/services/snippets/job_status.md renamed to docs/api_reference/snippets/job_status.md
Lines changed: 2 additions & 2 deletions b/‎docs/services/snippets/job_status.md‎ ‎docs/api_reference/snippets/job_status.md‎docs/services/snippets/job_status.md renamed to docs/api_reference/snippets/job_status.md
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/api_reference/snippets/model_deployment_1.md‎
Lines changed: 5 additions & 0 deletions b/‎docs/api_reference/snippets/model_deployment_1.md‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎docs/api_reference/snippets/model_deployment_2.md‎
Lines changed: 87 additions & 0 deletions b/‎docs/api_reference/snippets/model_deployment_2.md‎
Lines changed: 87 additions & 0 deletions
diff --git a/‎docs/api_reference/snippets/model_deployment_3.md‎
Lines changed: 48 additions & 0 deletions b/‎docs/api_reference/snippets/model_deployment_3.md‎
Lines changed: 48 additions & 0 deletions
diff --git a/‎docs/api_reference/snippets/model_deployment_4.md‎
Lines changed: 85 additions & 0 deletions b/‎docs/api_reference/snippets/model_deployment_4.md‎
Lines changed: 85 additions & 0 deletions
diff --git a/‎…cs/services/snippets/speech_to_text_1.md‎ ‎…i_reference/snippets/speech_to_text_1.md‎docs/services/snippets/speech_to_text_1.md renamed to docs/api_reference/snippets/speech_to_text_1.md b/‎…cs/services/snippets/speech_to_text_1.md‎ ‎…i_reference/snippets/speech_to_text_1.md‎docs/services/snippets/speech_to_text_1.md renamed to docs/api_reference/snippets/speech_to_text_1.md
diff --git a/‎…cs/services/snippets/speech_to_text_2.md‎ ‎…i_reference/snippets/speech_to_text_2.md‎docs/services/snippets/speech_to_text_2.md renamed to docs/api_reference/snippets/speech_to_text_2.md b/‎…cs/services/snippets/speech_to_text_2.md‎ ‎…i_reference/snippets/speech_to_text_2.md‎docs/services/snippets/speech_to_text_2.md renamed to docs/api_reference/snippets/speech_to_text_2.md
diff --git a/‎docs/api_reference/snippets/speech_to_text_3.md‎
Lines changed: 53 additions & 0 deletions b/‎docs/api_reference/snippets/speech_to_text_3.md‎
Lines changed: 53 additions & 0 deletions
diff --git a/‎…ervices/snippets/text_summarization_1.md‎ ‎…ference/snippets/text_summarization_1.md‎docs/services/snippets/text_summarization_1.md renamed to docs/api_reference/snippets/text_summarization_1.md b/‎…ervices/snippets/text_summarization_1.md‎ ‎…ference/snippets/text_summarization_1.md‎docs/services/snippets/text_summarization_1.md renamed to docs/api_reference/snippets/text_summarization_1.md
@@ -0,0 +1,9 @@
+==============
+Model Deployment API
+==============
+
+.. mdinclude:: snippets/model_deployment_1.md
+
+.. mdinclude:: snippets/model_deployment_2.md
+.. mdinclude:: snippets/model_deployment_3.md
+.. mdinclude:: snippets/model_deployment_4.md
@@ -1,9 +1,9 @@
 ### Check status of job
 
-Now that the audio file has been submitted for transcription, we can make requests to GET the status of the transcription, and eventually the result of the transcription.
+The request API is similar to all job submissions. We can make requests to GET the status of the jobs, and eventually the result of the submitted job, i.e. transcription, or speechification.
 
 ```bash
-GET https://api.slashml.com/speech-to-text/v1/status/YOUR-JOB-ID/
+GET https://api.slashml.com/speech-to-text/v1/jobs/YOUR-JOB-ID/
 ```
 
 #### Request
 
@@ -0,0 +1,5 @@
+This services allows you to fetch the status of a deployed model and call predictions on the exposed API.
+
+## Time to Integrate
+
+Less than 5 minute
@@ -0,0 +1,87 @@
+## Instructions
+
+1. (Optional) For deployment we recommend using the python SDK. However, you can also push your model using an API. To do this, you will first have to install `truss` to create a truss folder locally. Then compress that truss folder into a `.tar.gz` file. Then send a post `POST` request to `https://api.slashml.com/model-deployment/v1/models` with the compressed file as the body of the request. Save the `id` in the response object.
+2. Check the status of the model deployment by sending a `GET` request to `https://api.slashml.com/model-deployment/v1/models/YOUR-MODEL-ID/status`
+3. You can make predctions on the deployed model by sending a `POST` request to `https://api.slashml.com/model-deployment/v1/models/YOUR-MODEL-ID/predict`. The body should contain a json object with `model_input` which is the input prompt to the model.
+
+
+## Code Blocks
+
+### Submit model for deployment
+
+Install truss
+
+```
+pip install truss
+```
+
+
+You can the create a truss object by running the following command from within Python
+
+
+```python
+
+# you might have to install transfomers and torch
+from transformers import pipeline
+
+def train_model():
+    # Bring in model from huggingface
+    return pipeline('fill-mask', model='bert-base-uncased')
+
+my_model = train_model()
+
+# save the model
+truss.create(my_model, 'my_model')
+```
+
+Then convert the folder into a `.tar.gz` file
+
+```bash
+tar -czvf my_model.tar.gz my_model
+```
+
+#### Request
+
+Then send a post `POST` request to `https://api.slashml.com/model-deployment/v1/models` with the compressed file as the body of the request. Save the `id` in the response object.
+
+```python
+import requests
+
+url = "https://api.slashml.com/model-deployment/v1/models/"
+
+payload={'model_name': 'test-dep-model'}
+
+files=[
+  ('model_file',('my_model.tar.gz',open('path/to/my_model.tar.gz','rb'),'application/octet-stream'))
+]
+
+headers = {
+  'Authorization': 'Token YOUR_TOKEN'
+}
+
+response = requests.request("POST", url, headers=headers, data=payload, files=files)
+
+print(response.json())
+```
+
+
+#### Response (200)
+
+```bash
+{
+    "id": "a5822206-9680-444c-87ec-4b66a7bcfc26",
+    "created": "2023-06-13T06:38:55.311751Z",
+    "status": "IN_PROGRESS",
+    "name": "'test-dep-model'"
+}
+```
+
+#### Response (400)
+
+```python
+{
+    "error" : {
+        "message" : "something bad happened",
+    }
+}
+```
@@ -0,0 +1,48 @@
+### Check status of model
+
+```bash
+GET https://api.slashml.com/model-deployment/v1/models/YOUR-MODEL-ID/status
+```
+
+#### Request
+
+```python
+import requests
+
+url = 'https://api.slashml.com/model-deployment/v1/models/YOUR-MODEL-ID/status'
+
+headers = {
+  'Authorization': 'Token <YOUR_API_KEY>'
+}
+
+response = requests.get(url, headers=headers, data=payload)
+print(response.json())
+
+```
+
+#### Response (200) - MODEL-READy
+
+```bash
+{
+    # keep track of the id for later
+    "id": "ozfv3zim7-9725-4b54-9b71-f527bc21e5ab",
+    "created": "2023-06-13T06:38:55.311751Z",
+    "name": "test-dep-model",
+    "status": "MODEL_READY",
+    "name": "test-dep-model",
+}
+```
+
+#### Response (400) - Error
+
+```bash
+{
+    "error" : {
+        "message" : "something bad happened"
+    }
+}
+
+```
+
+> Note: 
+> The status will go from 'QUEUED' to 'BUILDING_MODEL' to 'DEPLOYING_MODEL' to 'MODEL_READY'. If there's an error processing your input, the status will go to 'FAILURE' and there will be an 'ERROR' key in the response JSON which will contain more information.
@@ -0,0 +1,85 @@
+### Submit a prediction to the model
+
+
+#### Request
+
+Then send a post `POST` request to `https://api.slashml.com/model-deployment/v1/models/YOUR-MODEL-ID/predict` with the model-input as the body of the request.
+
+```python
+import requests
+import json
+
+url = "https://api.slashml.com/model-deployment/v1/models/YOUR-MODEL-ID/predict"
+
+payload = json.dumps({
+  "model_input": [
+    "steve jobs is the [MASK] of apple"
+  ]
+})
+
+headers = {
+  'Authorization': 'Token a7011983a0f3d64ee113317b1e36f8e5bf56c14a',
+  'Content-Type': 'application/json'
+}
+
+response = requests.request("POST", url, headers=headers, data=payload)
+
+print(response.text)
+```
+
+
+#### Response (200)
+
+```bash
+{
+    "id": "a5822206-9680-444c-87ec-4b66a7bcfc26",
+    "model_input": [
+        "steve jobs is the [MASK] of apple"
+    ],
+    "model_response": {
+        "predictions": [
+            {
+                "score": 0.516463041305542,
+                "sequence": "steve jobs is the founder of apple",
+                "token": 3910,
+                "token_str": "founder"
+            },
+            {
+                "score": 0.3604991137981415,
+                "sequence": "steve jobs is the ceo of apple",
+                "token": 5766,
+                "token_str": "ceo"
+            },
+            {
+                "score": 0.04929964989423752,
+                "sequence": "steve jobs is the president of apple",
+                "token": 2343,
+                "token_str": "president"
+            },
+            {
+                "score": 0.021112028509378433,
+                "sequence": "steve jobs is the creator of apple",
+                "token": 8543,
+                "token_str": "creator"
+            },
+            {
+                "score": 0.008550147525966167,
+                "sequence": "steve jobs is the father of apple",
+                "token": 2269,
+                "token_str": "father"
+            }
+        ]
+    }
+}
+```
+
+#### Response (400)
+
+```python
+{
+    "error": "some error occured when requesting job status",
+    "full_message": [
+        "{'error': ErrorDetail(string='model not ready', code='permission_denied'), 'reasons': [ErrorDetail(string='model not ready', code='permission_denied'), ErrorDetail(string='model is still being built or deployed', code='permission_denied')]}"
+    ]
+}
+```
@@ -0,0 +1,53 @@
+### Convert audio to text
+
+The body of the request should contain a field `uploaded_audio_url`, the value of which shall contain the link to the uploaded audio url, and `service_provider` which is the name of the service provider you want to use.
+
+```bash
+POST https://api.slashml.com/speech-to-text/v1/jobs/
+```
+
+#### Request
+
+```python
+import requests
+
+url = 'https://api.slashml.com/speech-to-text/v1/jobs/'
+
+payload = {
+  "uploaded_audio_url": "https://cdn.slashml.com/upload/ccbbbfaf-f319-4455-9556-272d48faaf7f",
+  "service_provider": 'assembly'
+}
+headers = {
+    "Authorization": "Token <YOUR_API_KEY>",
+}
+
+response = requests.post(url, headers=headers, data=payload)
+print(response.text)
+```
+
+#### Response (200)
+
+```bash
+{
+    # keep track of the id for later
+    "id": "ozfv3zim7-9725-4b54-9b71-f527bc21e5ab",
+    # note that the status is now "processing"
+    "status": "IN_PROGESS",        
+    "audio_duration": null,
+    "audio_url": "https://cdn.slashml.com/upload/ccbbbfaf-f319-4455-9556-272d48faaf7f",
+    "text": null,
+}
+```
+
+#### Response (400)
+
+```bash
+{
+    "error" : {
+        "message" : "something bad happened"
+    }
+}
+```
+
+> Note: 
+> The 'id' will be used to fetch the status of the job, in the status endpoint.