Add support for Bedrock Converse API (Anthropic Messages API, Claude 3.5 Sonnet) #2851

austintlee · 2024-08-28T05:17:38Z

Description

Adds support for latest Message API

Related Issues

Resolves #2826

Check List

[ x] New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
[ x] Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

austintlee · 2024-08-28T13:46:05Z

I need a jpeg file and a pdf file for testing image and document contents and they are failing the utf-8 content checks.

I see that in the core, the ingest plugin has this:

 forbiddenPatterns {
   exclude '**/*.doc'
   exclude '**/*.docx'
   exclude '**/*.pdf'
   exclude '**/*.epub'
   exclude '**/*.vsdx'
 }

I can add that to plugin/build.gradle.

Zhangxunmt · 2024-08-29T18:16:41Z

@austintlee just a few comments left but LGTM overall.
Since this new API support will be used by the requester in #2826, do you mind adding a tutorial for using this API like this one https://github.com/opensearch-project/ml-commons/blob/main/docs/tutorials/conversational_search/conversational_search_with_Cohere_Command.md, and update the limitations specified in this tutorial? I am 100% sure that the requester will follow the tutorial to build solutions.

Zhangxunmt · 2024-08-29T17:54:44Z

...opensearch/searchpipelines/questionanswering/generative/ext/GenerativeQAParametersTests.java

    static class DummyStreamOutput extends StreamOutput {

        List<String> list = new ArrayList<>();
        List<Integer> intValues = new ArrayList<>();

        @Override
        public void writeString(String str) {
+            System.out.println("Adding string: " + str);


a leftover? remove this line?

It's just for debugging. This is only in a test.

Zhangxunmt · 2024-08-29T18:10:30Z

...ain/java/org/opensearch/searchpipelines/questionanswering/generative/llm/DefaultLlmImpl.java

@@ -136,6 +136,19 @@ protected Map<String, String> getInputParameters(ChatCompletionInput chatComplet
                            chatCompletionInput.getContexts()
                        )
                );
+        } else if (chatCompletionInput.getModelProvider() == ModelProvider.BEDROCK_CONVERSE) {


Instead of keep adding new "else if" blocks here, why not just define an annotation for different methods and use reflection to process different models servers at runtime? In that way I think we could reduce the code complexity here.

Given the time constraint, I would do the refactoring at a later time. I have been thinking about a good way to avoid this style of handling different cases of LLM vendors and APIs, but I was waiting for some general patterns to emerge which I think is this Message API, but again there are still small differences between LLM providers.

austintlee · 2024-08-29T20:03:19Z

@austintlee just a few comments left but LGTM overall. Since this new API support will be used by the requester in #2826, do you mind adding a tutorial for using this API like this one https://github.com/opensearch-project/ml-commons/blob/main/docs/tutorials/conversational_search/conversational_search_with_Cohere_Command.md, and update the limitations specified in this tutorial? I am 100% sure that the requester will follow the tutorial to build solutions.

Yes, I'll produce some documentation, but I don't have time this week. You can look at the IT tests to get a sense of the syntax for now if that helps with reviewing and making sense of the code.

ylwu-amzn · 2024-08-30T20:02:17Z

...main/java/org/opensearch/searchpipelines/questionanswering/generative/prompt/PromptUtil.java

        }

-        JsonArray messageArray = new JsonArray();
+        MessageArrayBuilder bldr = new MessageArrayBuilder(provider);


bldr is hard to understand , use more meaningful variable name?

austintlee · 2024-08-30T23:04:42Z

  {"error":{"root_cause":[{"type":"status_exception","reason":"Error from remote service: {\"message\":\"The security token included in the request is invalid.\"}"}],"type":"status_exception","reason":"Error from remote service: {\"message\":\"The security token included in the request is invalid.\"}"},"status":403}

austintlee · 2024-08-30T23:04:51Z

Is this an issue with GH setup?

mingshl · 2024-09-05T20:35:01Z

plugin/src/test/resources/org/opensearch/ml/rest/test_data/openai_boardwalk.jpg

Do we need license for the picture or pdf in this PR? @kolchfa-aws

@austintlee is this image generated from openai image generation?

@austintlee Could you tell me the original source of this image please?

FYI, that image is from OpenAI's API documentation - https://platform.openai.com/docs/api-reference/chat/create. It's in the example they give for including images in chat completion.

The actual source is Wikipedia.

Zhangxunmt · 2024-09-06T01:04:41Z

@austintlee any objection to merge the PR if the issue is only in the test itself? Let’s use a separate PR to track the IT issue.

austintlee · 2024-09-06T01:35:01Z

RestBedRockInferenceIT > test_bedrock_multimodal_model FAILED
    org.opensearch.client.ResponseException: method [POST], host [http://[::1]:38749], URI [/_plugins/_ml/models/null/_deploy], status line [HTTP/1.1 404 Not Found]
    {"error":{"root_cause":[{"type":"status_exception","reason":"Failed to find model"}],"type":"status_exception","reason":"Failed to find model"},"status":404}

This is not my test. Is this a known issue? Some (internal) model got removed? Am I the only one seeing this issue?

austintlee · 2024-09-06T01:35:53Z

Also, all these time-outs we were seeing - I don't see this happening on Windows or Mac. Should I try to repro it on linux? It's really frustrating that I can't repro it locally.

Zhangxunmt · 2024-09-06T01:42:29Z

Also, all these time-outs we were seeing - I don't see this happening on Windows or Mac. Should I try to repro it on linux? It's really frustrating that I can't repro it locally.

That test flaky is because of the congestion in the ITs running environment. We should remove some outdated or duplicate ITs to unchoke. That multi modal IT should update to use auto deploy so it won’t fail due to model not found because it will be auto deployed in that case.

austintlee · 2024-09-06T03:27:42Z

Can we merge this?

opensearch-trigger-bot · 2024-09-06T16:30:57Z

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-2851-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 17e81ae618bebc72b8d3cb76d57f1556b1d8c8e1
# Push it to GitHub
git push --set-upstream origin backport/backport-2851-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-2851-to-2.x.

opensearch-trigger-bot · 2024-09-06T16:30:58Z

The backport to 2.17 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.17 2.17
# Navigate to the new working tree
cd .worktrees/backport-2.17
# Create a new branch
git switch --create backport/backport-2851-to-2.17
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 17e81ae618bebc72b8d3cb76d57f1556b1d8c8e1
# Push it to GitHub
git push --set-upstream origin backport/backport-2851-to-2.17
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.17

Then, create a pull request where the base branch is 2.17 and the compare/head branch is backport/backport-2851-to-2.17.

…3.5 Sonnet) (opensearch-project#2851) * Add support for Anthropic Message API (Issue 2826) Signed-off-by: Austin Lee <austin@aryn.ai> * Fix a bug. Signed-off-by: Austin Lee <austin@aryn.ai> * Add unit tests, improve coverage, clean up code. Signed-off-by: Austin Lee <austin@aryn.ai> * Allow pdf and jpg files for IT tests for multimodel conversation API testing. Signed-off-by: Austin Lee <austin@aryn.ai> * Fix spotless check issues. Signed-off-by: Austin Lee <austin@aryn.ai> * Update IT to work with session tokens. Signed-off-by: Austin Lee <austin@aryn.ai> * Fix MLRAGSearchProcessorIT not to extend RestMLRemoteInferenceIT. Signed-off-by: Austin Lee <austin@aryn.ai> * Use suite specific model group name. Signed-off-by: Austin Lee <austin@aryn.ai> * Disable tests that require futher investigation. Signed-off-by: Austin Lee <austin@aryn.ai> * Skip two additional tests with time-outs. Signed-off-by: Austin Lee <austin@aryn.ai> * Restore a change from RestMLRemoteInferenceIT. Signed-off-by: Austin Lee <austin@aryn.ai> --------- Signed-off-by: Austin Lee <austin@aryn.ai> (cherry picked from commit 17e81ae)

austintlee · 2024-09-06T23:46:24Z

I opened #2914 to bring back the rest of the tests.

austintlee · 2024-09-07T00:40:34Z

Doc - opensearch-project/documentation-website#8195. I don't have a PR, yet. Do I need to prepare a PR or is it someone from the docs team?

…3.5 Sonnet) (#2851) (#2912) * Add support for Anthropic Message API (Issue 2826) Signed-off-by: Austin Lee <austin@aryn.ai> * Fix a bug. Signed-off-by: Austin Lee <austin@aryn.ai> * Add unit tests, improve coverage, clean up code. Signed-off-by: Austin Lee <austin@aryn.ai> * Allow pdf and jpg files for IT tests for multimodel conversation API testing. Signed-off-by: Austin Lee <austin@aryn.ai> * Fix spotless check issues. Signed-off-by: Austin Lee <austin@aryn.ai> * Update IT to work with session tokens. Signed-off-by: Austin Lee <austin@aryn.ai> * Fix MLRAGSearchProcessorIT not to extend RestMLRemoteInferenceIT. Signed-off-by: Austin Lee <austin@aryn.ai> * Use suite specific model group name. Signed-off-by: Austin Lee <austin@aryn.ai> * Disable tests that require futher investigation. Signed-off-by: Austin Lee <austin@aryn.ai> * Skip two additional tests with time-outs. Signed-off-by: Austin Lee <austin@aryn.ai> * Restore a change from RestMLRemoteInferenceIT. Signed-off-by: Austin Lee <austin@aryn.ai> --------- Signed-off-by: Austin Lee <austin@aryn.ai> (cherry picked from commit 17e81ae)

…3.5 Sonnet) (#2851) (#2913) * Add support for Anthropic Message API (Issue 2826) Signed-off-by: Austin Lee <austin@aryn.ai> * Fix a bug. Signed-off-by: Austin Lee <austin@aryn.ai> * Add unit tests, improve coverage, clean up code. Signed-off-by: Austin Lee <austin@aryn.ai> * Allow pdf and jpg files for IT tests for multimodel conversation API testing. Signed-off-by: Austin Lee <austin@aryn.ai> * Fix spotless check issues. Signed-off-by: Austin Lee <austin@aryn.ai> * Update IT to work with session tokens. Signed-off-by: Austin Lee <austin@aryn.ai> * Fix MLRAGSearchProcessorIT not to extend RestMLRemoteInferenceIT. Signed-off-by: Austin Lee <austin@aryn.ai> * Use suite specific model group name. Signed-off-by: Austin Lee <austin@aryn.ai> * Disable tests that require futher investigation. Signed-off-by: Austin Lee <austin@aryn.ai> * Skip two additional tests with time-outs. Signed-off-by: Austin Lee <austin@aryn.ai> * Restore a change from RestMLRemoteInferenceIT. Signed-off-by: Austin Lee <austin@aryn.ai> --------- Signed-off-by: Austin Lee <austin@aryn.ai> (cherry picked from commit 17e81ae)

kolchfa-aws · 2024-09-09T16:20:51Z

@austintlee Our workflow is that the feature developer puts up a documentation PR in the doc repo and then the doc team reviews. Please feel free to ask any questions. Thank you!

… Claude 3.5 Sonnet) (opensearch-project#2851) (opensearch-project#2913)" This reverts commit ed37690.

… Claude 3.5 Sonnet) (#2851) (#2913)" (#2929) This reverts commit ed37690.

austintlee requested review from b4sjoo, dhrubo-os, jngz-es, model-collapse, rbhavna, ylwu-amzn, zane-neo, Zhangxunmt, HenryL27, samuel-oci and xinyual as code owners August 28, 2024 05:17

austintlee had a problem deploying to ml-commons-cicd-env August 28, 2024 05:17 — with GitHub Actions Failure

Zhangxunmt reviewed Aug 29, 2024

View reviewed changes

Zhangxunmt previously approved these changes Aug 30, 2024

View reviewed changes

ylwu-amzn reviewed Aug 30, 2024

View reviewed changes

austintlee had a problem deploying to ml-commons-cicd-env August 30, 2024 21:19 — with GitHub Actions Failure

austintlee dismissed Zhangxunmt’s stale review via 239a695 August 30, 2024 21:48

austintlee had a problem deploying to ml-commons-cicd-env August 30, 2024 21:48 — with GitHub Actions Failure

austintlee had a problem deploying to ml-commons-cicd-env August 30, 2024 22:13 — with GitHub Actions Failure

austintlee had a problem deploying to ml-commons-cicd-env August 30, 2024 23:27 — with GitHub Actions Failure

austintlee had a problem deploying to ml-commons-cicd-env August 30, 2024 23:28 — with GitHub Actions Failure

mingshl reviewed Sep 5, 2024

View reviewed changes

austintlee had a problem deploying to ml-commons-cicd-env September 5, 2024 23:28 — with GitHub Actions Failure

Zhangxunmt approved these changes Sep 6, 2024

View reviewed changes

austintlee had a problem deploying to ml-commons-cicd-env September 6, 2024 01:43 — with GitHub Actions Failure

austintlee had a problem deploying to ml-commons-cicd-env September 6, 2024 04:16 — with GitHub Actions Failure

austintlee temporarily deployed to ml-commons-cicd-env September 6, 2024 05:48 — with GitHub Actions Inactive

austintlee temporarily deployed to ml-commons-cicd-env September 6, 2024 06:42 — with GitHub Actions Inactive

ylwu-amzn approved these changes Sep 6, 2024

View reviewed changes

ylwu-amzn added backport 2.x backport 2.17 labels Sep 6, 2024

ylwu-amzn merged commit 17e81ae into opensearch-project:main Sep 6, 2024
9 checks passed

austintlee mentioned this pull request Sep 6, 2024

Add support for Bedrock Converse API (Anthropic Messages API, Claude … #2912

Merged

5 tasks

austintlee mentioned this pull request Sep 6, 2024

Add support for Bedrock Converse API (Anthropic Messages API, Claude … #2913

Merged

5 tasks

Zhangxunmt added a commit to Zhangxunmt/ml-commons that referenced this pull request Sep 10, 2024

Revert "Add support for Bedrock Converse API (Anthropic Messages API,…

a0b79e8

… Claude 3.5 Sonnet) (opensearch-project#2851) (opensearch-project#2913)" This reverts commit ed37690.

Zhangxunmt mentioned this pull request Sep 10, 2024

Revert "Add support for Bedrock Converse API (Anthropic Messages API,… #2929

Merged

5 tasks

Zhangxunmt added a commit that referenced this pull request Sep 10, 2024

Revert "Add support for Bedrock Converse API (Anthropic Messages API,…

0135cb9

… Claude 3.5 Sonnet) (#2851) (#2913)" (#2929) This reverts commit ed37690.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Bedrock Converse API (Anthropic Messages API, Claude 3.5 Sonnet) #2851

Add support for Bedrock Converse API (Anthropic Messages API, Claude 3.5 Sonnet) #2851

austintlee commented Aug 28, 2024

austintlee commented Aug 28, 2024

Zhangxunmt commented Aug 29, 2024

Zhangxunmt Aug 29, 2024

austintlee Aug 30, 2024

Zhangxunmt Aug 29, 2024

austintlee Aug 29, 2024

austintlee commented Aug 29, 2024

ylwu-amzn Aug 30, 2024

austintlee Aug 30, 2024

austintlee commented Aug 30, 2024

austintlee commented Aug 30, 2024

mingshl Sep 5, 2024 •

edited

Loading

mingshl Sep 5, 2024

kolchfa-aws Sep 5, 2024

austintlee Sep 5, 2024

austintlee Sep 5, 2024

Zhangxunmt commented Sep 6, 2024

austintlee commented Sep 6, 2024

austintlee commented Sep 6, 2024

Zhangxunmt commented Sep 6, 2024

austintlee commented Sep 6, 2024

opensearch-trigger-bot bot commented Sep 6, 2024

opensearch-trigger-bot bot commented Sep 6, 2024

austintlee commented Sep 6, 2024

austintlee commented Sep 7, 2024

kolchfa-aws commented Sep 9, 2024

Add support for Bedrock Converse API (Anthropic Messages API, Claude 3.5 Sonnet) #2851

Add support for Bedrock Converse API (Anthropic Messages API, Claude 3.5 Sonnet) #2851

Conversation

austintlee commented Aug 28, 2024

Description

Related Issues

Check List

austintlee commented Aug 28, 2024

Zhangxunmt commented Aug 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

austintlee commented Aug 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

austintlee commented Aug 30, 2024

austintlee commented Aug 30, 2024

mingshl Sep 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Zhangxunmt commented Sep 6, 2024

austintlee commented Sep 6, 2024

austintlee commented Sep 6, 2024

Zhangxunmt commented Sep 6, 2024

austintlee commented Sep 6, 2024

opensearch-trigger-bot bot commented Sep 6, 2024

opensearch-trigger-bot bot commented Sep 6, 2024

austintlee commented Sep 6, 2024

austintlee commented Sep 7, 2024

kolchfa-aws commented Sep 9, 2024

mingshl Sep 5, 2024 •

edited

Loading