Add Feature Extraction Support for API Classifiers #77

mohamedelabbas1996 · 2025-04-14T18:36:13Z

Description

This PR adds support for returning model feature vectors (embeddings) alongside classification results in the Data Companion API.
The classification pipeline now supports returning a vector embedding per classification, derived from the classification model backbone.

The changes are fully backward-compatible for models that do not implement custom get_features(), as they will fallback to returningNone from the base class.

Related Issues

#752

Screenshots

Detection features clustering visualization using K-means + PCA

sentry · 2025-04-14T18:36:25Z

🔍 Existing Issues For Review

Your pull request is modifying functions with the following pre-existing issues:

📄 File: trapdata/api/models/classification.py

Function	Unhandled Issue
`save_results`	ValidationError: 15 validation errors for ClassificationResponse ... `Event Count:` 2
`save_results`	ValidationError: 10 validation errors for ClassificationResponse ... `Event Count:` 2
`save_results`	AttributeError: 'NoneType' object has no attribute 'tolist' ... `Event Count:` 1
`save_results`	ValueError: not enough values to unpack (expected 3, got 2) ... `Event Count:` 1

_{Did you find this useful? React with a 👍 or 👎}

mihow · 2025-04-26T00:25:38Z

pyproject.toml

 ]
-
+plotly = "^5.21.0"
+scikit-learn = "^1.3.0"


I think we should make these optional dependencies and just use numpy in the tests. unless we need to use them in the core app.

[tool.poetry.extras] dev = ["plotly", "scikit-learn"]

trapdata/api/tests/test_features_extraction.py

mihow · 2025-04-26T00:29:46Z

trapdata/ml/models/classification.py

        model.eval()
        return model

+    def get_features(self, batch_input: torch.Tensor) -> torch.Tensor:


Nice work on this method of extracting features! It seems more flexible than our current feature extractor. Perhaps we should add a comment in both feature extractors that the other one exists. And eventually update the old one to use this code.

mohamedelabbas1996 added 4 commits April 13, 2025 21:00

feat: Added features field to the classification response

368edc2

feat: add support for returning features in APIMothClassifier response

4484f2e

added fallback get_features method to the InferenceBaseClass

3cc31ad

feat: implemented get_features for Resnet50TimmClassifier class

8071168

mohamedelabbas1996 added 4 commits April 14, 2025 14:43

chore: moved features dim to constants

52f0f62

Default to None if get_features is not implemented

b4c3af7

Added features extraction tests

ae62dd5

Removed prints

88c8220

mohamedelabbas1996 marked this pull request as ready for review April 22, 2025 15:38

mohamedelabbas1996 added 3 commits April 23, 2025 10:15

Added clustering using K-Means and visualization

fa7dee8

Added plotly dependency

cce38f3

Added sklearn dependency

902331b

mihow reviewed Apr 26, 2025

View reviewed changes

trapdata/api/tests/test_features_extraction.py Show resolved Hide resolved

mihow reviewed Apr 26, 2025

View reviewed changes

chore: make plotly optional, fix type warnings

9306bd0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Feature Extraction Support for API Classifiers #77

Add Feature Extraction Support for API Classifiers #77

Uh oh!

mohamedelabbas1996 commented Apr 14, 2025 •

edited

Loading

Uh oh!

sentry bot commented Apr 14, 2025

Uh oh!

mihow Apr 26, 2025

Uh oh!

Uh oh!

mihow Apr 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Feature Extraction Support for API Classifiers #77

Are you sure you want to change the base?

Add Feature Extraction Support for API Classifiers #77

Uh oh!

Conversation

mohamedelabbas1996 commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Screenshots

Uh oh!

sentry bot commented Apr 14, 2025

🔍 Existing Issues For Review

Uh oh!

mihow Apr 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mihow Apr 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mohamedelabbas1996 commented Apr 14, 2025 •

edited

Loading