-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: provide saner semantics around aws_session_token #1295
Conversation
feat: aws credentials bug is working. https://gloo-global.slack.com/archives/C03KV1PJ6EM/p1736215459393779 new logic if we don't set any of the credentials manually in the clients.baml, playground, or client registry then we will fallback to use the environment variables. Otherwise, we have to set all the environment variables in order to take them in. remove runtest
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Looks good to me! Reviewed everything up to 3fa4835 in 1 minute and 42 seconds
More details
- Looked at
2869
lines of code in37
files - Skipped
2
files when reviewing. - Skipped posting
1
drafted comments based on config settings.
1. engine/baml-lib/llm-client/src/clients/aws_bedrock.rs:205
- Draft comment:
Consider refactoring the repeated pattern of fetching environment variables into a helper function to reduce redundancy and improve maintainability. - Reason this comment was not posted:
Confidence changes required:50%
The code in aws_bedrock.rs has a repeated pattern for fetching environment variables. This can be refactored to a helper function to reduce redundancy and improve maintainability.
Workflow ID: wflow_0WDxqhPg3pw13jFk
You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet
mode, and more.
(None, None, None) => { | ||
// If no credentials provided, get them all from env vars | ||
let access_key_id = match ctx.get_env_var("AWS_ACCESS_KEY_ID") { | ||
Ok(key) if !key.is_empty() => Some(key), | ||
_ => None, | ||
}; | ||
let secret_access_key = match ctx.get_env_var("AWS_SECRET_ACCESS_KEY") { | ||
Ok(key) if !key.is_empty() => Some(key), | ||
_ => None, | ||
}; | ||
let session_token = match ctx.get_env_var("AWS_SESSION_TOKEN") { | ||
Ok(token) if !token.is_empty() => Some(token), | ||
_ => None, | ||
}; | ||
(access_key_id, secret_access_key, session_token) | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@seawatts i suspect this can be collapsed as
(None, None, None) => { | |
// If no credentials provided, get them all from env vars | |
let access_key_id = match ctx.get_env_var("AWS_ACCESS_KEY_ID") { | |
Ok(key) if !key.is_empty() => Some(key), | |
_ => None, | |
}; | |
let secret_access_key = match ctx.get_env_var("AWS_SECRET_ACCESS_KEY") { | |
Ok(key) if !key.is_empty() => Some(key), | |
_ => None, | |
}; | |
let session_token = match ctx.get_env_var("AWS_SESSION_TOKEN") { | |
Ok(token) if !token.is_empty() => Some(token), | |
_ => None, | |
}; | |
(access_key_id, secret_access_key, session_token) | |
} | |
(None, None, None) => { | |
// If no credentials provided, get them all from env vars | |
(ctx.get_env_var("AWS_ACCESS_KEY_ID").ok(), ctx.get_env_var("AWS_SECRET_ACCESS_KEY").ok(), ctx.get_env_var("AWS_SESSION_TOKEN").ok()) | |
} |
but i don't want to put this through another round of testing right now so i'm leaving it as-is
(i'm not sure if the empty string difference is particularly significant here, i think it just triggers the fallback-to-env-var logic twice via two different paths
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Looks good to me! Reviewed everything up to 8199507 in 17 seconds
More details
- Looked at
182
lines of code in2
files - Skipped
0
files when reviewing. - Skipped posting
2
drafted comments based on config settings.
1. engine/baml-lib/llm-client/src/clients/aws_bedrock.rs:205
- Draft comment:
The logic for loading AWS credentials has been updated to ensure that if any of the credentials (access_key_id, secret_access_key, session_token) are set, they are used explicitly, and if none are set, they are loaded from environment variables. This change aligns with the behavior of AWS SDKs. - Reason this comment was not posted:
Confidence changes required:10%
The code inaws_bedrock.rs
andaws_client.rs
has been updated to change the logic for loading AWS credentials. The new logic ensures that if any of the credentials (access_key_id, secret_access_key, session_token) are set, they are used explicitly, and if none are set, they are loaded from environment variables. This change aligns with the behavior of AWS SDKs.
2. engine/baml-runtime/src/internal/llm_client/primitive/aws/aws_client.rs:149
- Draft comment:
The logic for loading AWS credentials has been updated to ensure that if any of the credentials (access_key_id, secret_access_key, session_token) are set, they are used explicitly, and if none are set, they are loaded from environment variables. This change aligns with the behavior of AWS SDKs. - Reason this comment was not posted:
Confidence changes required:10%
The code inaws_client.rs
has been updated to change the logic for loading AWS credentials. The new logic ensures that if any of the credentials (access_key_id, secret_access_key, session_token) are set, they are used explicitly, and if none are set, they are loaded from environment variables. This change aligns with the behavior of AWS SDKs.
Workflow ID: wflow_LPzSp0Hb5O6Ei43Z
You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet
mode, and more.
Change aws credential loading logic: - if any of `access_key_id`, `secret_access_key`, or `session_token` are set, all 3 are loaded explicitly (either from the `.baml` client definition or the dynamic client properties) - if, and only if, none of the 3 are set, all 3 are loaded from, respectively, `AWS_ACCESS_KEY_ID` `AWS_SECRET_ACCESS_KEY` `AWS_SESSION_TOKEN` This most closely matches the behavior of the AWS SDKs (Python, TS, and Rust). See [slack thread](https://gloo-global.slack.com/archives/C03KV1PJ6EM/p1736215459393779) which is copied below: > OK, so chris and i figured out what happened with bedrock/ethan: > > in #1266, chris correctly added support for aws session token so that if a user set it in aws.baml as properties { session_token env.AWS_SESSION_TOKEN }, baml would respect that (prior to #1266 baml would not) > > - however, 1266 also introduced an implicit default: if AWS_SESSION_TOKEN is set in the process' environment, but the user only set properties { access_key_id ... ; secret_access_key ... ; } then baml would construct the aws creds using access_key_id secret_access_key and session_token > - in ethan's case this is problematic, because he uses custom values for the access_key_id secret_access_key pair from his lambda secrets, but the aws lambda environment also sets AWS_ACCESS_KEY_ID AWS_SECRET_ACCESS_KEY AWS_SESSION_TOKEN > - as a result, after updating past #1266, his (access_key_id, secret_access_key) did not agree with his AWS_SESSION_TOKEN and caused a runtime failure > - he also has no way to opt out of this behavior, because we do not currently provide a way to force session_token to null: it is always inferred from the environment by default > > to solve this, we're going to use the following logic for aws creds: > - if any of access_key_id, secret_access_key, or session_token are set in baml client properties, we will never magically infer a value for any of the 3 from the environment > - but if none of the 3 are set, we will read all 3 from the env > - this behavior feels most in line with how credential init in the aws sdk normally works > - in ts, an AwsCredentialIdentityProvider is any function that returns { accessKeyId: string, secretAccessKey: string, sessionToken?: string, expiration?: Date} (docs) > - in python, if you set any of the 3, a creds object is constructed using the explicitly provided values of all 3 (impl callsite, impl source code) > - in rust, this is what constructing Credentials::new does when you override the credentials_loader > > stepping back, this is a mix of the two approaches: (1) session_token: unset defaults to reading AWS_SESSION_TOKEN from the env, and user is allowed to explicitly set session_token: null or (2) session_token: unset never reads from the env, and the user must always set it > > NB: this does not explain why multiple other customers are complaining about not being able to figure out how to use aws bedrock. so @Vaibhav Gupta we still need to see what those other complaints are <!-- ELLIPSIS_HIDDEN --> ---- > [!IMPORTANT] > Updates AWS credential loading logic to use explicit credentials if set, otherwise defaults to environment variables, aligning with AWS SDK behavior. > > - **Behavior**: > - Updates AWS credential loading logic in `aws_bedrock.rs` and `aws_client.rs`. > - If any of `access_key_id`, `secret_access_key`, or `session_token` are set, all are loaded explicitly. > - If none are set, all are loaded from environment variables `AWS_ACCESS_KEY_ID`, `AWS_SECRET_ACCESS_KEY`, `AWS_SESSION_TOKEN`. > - **Error Handling**: > - Adds error checks for environment variable placeholders in `aws_client.rs`. > - **Misc**: > - Adjusts credential provider logic in `aws_client.rs` to use `DefaultCredentialsChain` when no credentials are provided. > > <sup>This description was created by </sup>[<img alt="Ellipsis" src="https://img.shields.io/badge/Ellipsis-blue?color=175173">](https://www.ellipsis.dev?ref=BoundaryML%2Fbaml&utm_source=github&utm_medium=referral)<sup> for 8199507. It will automatically update as commits are pushed.</sup> <!-- ELLIPSIS_HIDDEN --> --------- Co-authored-by: Chris Watts <chris.watts.t@gmail.com>
Main bug fix #1295 These are the integ-tests changes <!-- ELLIPSIS_HIDDEN --> ---- > [!IMPORTANT] > Split integration tests into provider-specific files, add dynamic types and error handling tests, and update configuration for comprehensive testing. > > - **Integration Tests**: > - Split integration tests into provider-specific files, including `anthropic.test.ts`, `aws.test.ts`, `azure.test.ts`, `gemini.test.ts`, `openai.test.ts`, and `vertex.test.ts`. > - Add tests for dynamic types, error handling, input/output, recursive types, and tracing in `dynamic-types.test.ts`, `error-handling.test.ts`, `input-output.test.ts`, `recursive-types.test.ts`, and `tracing.test.ts`. > - Implement streaming tests for providers like AWS, Azure, Gemini, and OpenAI. > - **Configuration**: > - Update `jest.config.js` to use `ts-jest` and set up test environment and module mappings. > - Modify `package.json` scripts to include new test commands and dependencies. > - **Error Handling**: > - Add tests to handle invalid argument types, client configuration errors, and HTTP errors in `error-handling.test.ts`. > - **Dynamic Client Registry**: > - Test dynamic client configuration and credential resolution for AWS in `aws.test.ts`. > > <sup>This description was created by </sup>[<img alt="Ellipsis" src="https://img.shields.io/badge/Ellipsis-blue?color=175173">](https://www.ellipsis.dev?ref=BoundaryML%2Fbaml&utm_source=github&utm_medium=referral)<sup> for 1d7b065. It will automatically update as commits are pushed.</sup> <!-- ELLIPSIS_HIDDEN --> --------- Co-authored-by: Chris Watts <chris.watts.t@gmail.com>
Main bug fix #1295 These are the integ-tests changes <!-- ELLIPSIS_HIDDEN --> ---- > [!IMPORTANT] > Split integration tests into provider-specific files, add dynamic types and error handling tests, and update configuration for comprehensive testing. > > - **Integration Tests**: > - Split integration tests into provider-specific files, including `anthropic.test.ts`, `aws.test.ts`, `azure.test.ts`, `gemini.test.ts`, `openai.test.ts`, and `vertex.test.ts`. > - Add tests for dynamic types, error handling, input/output, recursive types, and tracing in `dynamic-types.test.ts`, `error-handling.test.ts`, `input-output.test.ts`, `recursive-types.test.ts`, and `tracing.test.ts`. > - Implement streaming tests for providers like AWS, Azure, Gemini, and OpenAI. > - **Configuration**: > - Update `jest.config.js` to use `ts-jest` and set up test environment and module mappings. > - Modify `package.json` scripts to include new test commands and dependencies. > - **Error Handling**: > - Add tests to handle invalid argument types, client configuration errors, and HTTP errors in `error-handling.test.ts`. > - **Dynamic Client Registry**: > - Test dynamic client configuration and credential resolution for AWS in `aws.test.ts`. > > <sup>This description was created by </sup>[<img alt="Ellipsis" src="https://img.shields.io/badge/Ellipsis-blue?color=175173">](https://www.ellipsis.dev?ref=BoundaryML%2Fbaml&utm_source=github&utm_medium=referral)<sup> for 1d7b065. It will automatically update as commits are pushed.</sup> <!-- ELLIPSIS_HIDDEN --> --------- Co-authored-by: Chris Watts <chris.watts.t@gmail.com>
Change aws credential loading logic:
access_key_id
,secret_access_key
, orsession_token
are set, all 3 are loaded explicitly (either from the.baml
client definition or the dynamic client properties)AWS_ACCESS_KEY_ID
AWS_SECRET_ACCESS_KEY
AWS_SESSION_TOKEN
This most closely matches the behavior of the AWS SDKs (Python, TS, and Rust). See slack thread which is copied below:
Important
Updates AWS credential loading logic to use explicit credentials if set, otherwise defaults to environment variables, aligning with AWS SDK behavior.
aws_bedrock.rs
andaws_client.rs
.access_key_id
,secret_access_key
, orsession_token
are set, all are loaded explicitly.AWS_ACCESS_KEY_ID
,AWS_SECRET_ACCESS_KEY
,AWS_SESSION_TOKEN
.aws_client.rs
.aws_client.rs
to useDefaultCredentialsChain
when no credentials are provided.This description was created by for 8199507. It will automatically update as commits are pushed.