[Enhancement] Generated remote model interfaces should include detailed model outputs #2945

ohltyler · 2024-09-13T17:29:28Z

The current generated model output interfaces just include the boilerplate. The actual useful model outputs will be under dataAsMap, but this is always defined as a generic object with no sub-properties.

"inference_results": {
    "type": "array",
    "items": {
        "type": "object",
        "properties": {
            "output": {
                "type": "array",
                "items": {
                    "properties": {
                        "name": {
                            "type": "string",
                            "description": "This is a test description field"
                        },
                        "dataAsMap": {
                            "type": "object",
                            "description": "This is a test description field"
                        }
                    }
                },
                "description": "This is a test description field"
            },
            "status_code": {
                "type": "integer",
                "description": "This is a test description field"
            }
        }
    },
    "description": "This is a test description field"
}

The dataAsMap field should be further defined to include the actual model outputs

The text was updated successfully, but these errors were encountered:

b4sjoo · 2024-09-16T18:15:46Z

@ohltyler Can you check https://github.com/opensearch-project/ml-commons/pull/2689/files to see if you are satisfied with the current granularity for our output interface

ohltyler · 2024-09-16T18:48:23Z

Got it, thanks for the reference! I've tested with some other blueprints, and it seems to work as expected. I think the confusion I was having was around how much flexibility can be made to the connectors, before it stops producing the detailed interfaces. The documentation states:

The predefined model interface is generated based on the connector blueprint and the model's metadata, so you must strictly follow the blueprint when creating the connector in order to avoid errors.

For example, I had some blueprints that had a supported model defined in-line, but not included as one of the parameters. Or some models that weren't part of the list I'm seeing defined here.

From that code, it looks like a blueprint will be generated automatically if & only if:

There is a service_name and model defined under parameters
The values of service_name / model are included in the switch statement
All other parts of the blueprint (the protocol/credentials/actions/request body etc.) can be anything

If so, can we update the documentation to reflect this a bit more, on what flexibility and what restrictions are put in place? It would be helpful to know how users can still have custom request bodies, for example, but still leverage automated model interfaces.

ohltyler added the enhancement New feature or request label Sep 13, 2024

github-actions bot added the untriaged label Sep 13, 2024

ylwu-amzn assigned b4sjoo Sep 16, 2024

b4sjoo removed the untriaged label Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Generated remote model interfaces should include detailed model outputs #2945

[Enhancement] Generated remote model interfaces should include detailed model outputs #2945

ohltyler commented Sep 13, 2024

b4sjoo commented Sep 16, 2024

ohltyler commented Sep 16, 2024

[Enhancement] Generated remote model interfaces should include detailed model outputs #2945

[Enhancement] Generated remote model interfaces should include detailed model outputs #2945

Comments

ohltyler commented Sep 13, 2024

b4sjoo commented Sep 16, 2024

ohltyler commented Sep 16, 2024