1182 ml classification queue #1219

CarsonDavis · 2025-02-07T00:24:32Z

No description provided.

…otes

inference/models/inference.py

CarsonDavis · 2025-02-13T16:08:16Z

inference/utils/inference_api_client.py

+        except requests.exceptions.RequestException as e:
+            return {"status": JobStatusEnum.FAILED, "message": f"API request failed: {str(e)}"}
+
+    def load_model(self, model_identifier: str) -> bool:


rename to request_load_model

CarsonDavis · 2025-02-13T16:16:24Z

inference/utils/inference_api_client.py

+        self.model_identifier = model_identifier
+
+    def _unload_all_other_models(self) -> bool:
+        """Unload all models except the one managed by this instance"""


Need to implement
_request_model_unload: sends the unload request
unload_model: calls request, then send status check request until confirmation of unload, with error handling if max retries met
unload_all_models:run _unload for all models

then we can implement a similar pattern on the loading side as well.
Lastly, lets merge model manager into apiclient, since there is no logical separation.

inference/models/inference.py

CarsonDavis · 2025-02-13T16:25:12Z

inference/models/inference.py

+        """Process this external job and update status/results"""
+        try:
+            api_client = InferenceAPIClient()
+            model_version = ModelVersion.objects.get(classification_type=self.inference_job.classification_type)


why do we need model version?

Check the API documentation to see if a model identifier is needed in conjunction with the job_id

CarsonDavis · 2025-02-13T16:27:12Z

inference/models/inference.py

+            if new_status == ExternalJobStatus.COMPLETED:
+                self.store_results(response.get("results"))
+            elif new_status in [ExternalJobStatus.FAILED, ExternalJobStatus.CANCELLED]:
+                self.set_error(response.get("message", ""))


add note about failed or cancelled if no message is present.

CarsonDavis · 2025-02-13T16:44:15Z

inference/utils/batch.py

+        """Prepare single URL data for API"""
+        return {
+            "url_id": url.id,
+            "text": url.scraped_text[: self.config.max_text_length],


Need to refactor this. It should take in a max text length only, not a url count. And then it should grab as many urls as will fit into the max text length. If a single url exceeds the max text length, then that url can be sent by itself in a batch of 1.

…k values

for more information, see https://pre-commit.ci

dhanur-sharma · 2025-03-07T21:53:42Z

local.yml

+    container_name: sde_indexing_helper_local_celerybeat
+    depends_on:
+      - redis
+      - postgres


Should we also add this config to the production.yml file?

CarsonDavis added 5 commits January 30, 2025 14:17

migrate from single tasks.py to tasks folder

f9068c5

add initial inference models and tasks

323196e

add initial notes on queue functioning

e0a87ca

add initial inference app

ee79af2

reconsolidate sde_collections/tasks

c0dfa60

CarsonDavis linked an issue Feb 7, 2025 that may be closed by this pull request

ML Classification Queue #1182

Open

CarsonDavis added 10 commits February 6, 2025 18:26

Merge branch 'dev' into 1182-ml-classification-queue

be78521

add local inference pipeline integration tests

5c06e32

fix local_test_inference_integration run path instructions

214be69

update InferenceAPIClient to pass only text to the pipeline

79d0252

add initial changelog template with classification queue deployment n…

7bcec7e

…otes

add a verbose name for the InferenceApp and add it to the base settings

881676b

refactor InferenceJob processing pipeline

bc4e34e

expand ModelVersion model

9938f42

add INFERENCE_API_URL to base.py

1a0bc4e

add migrations for new inference models

ff31aa7