Adds Places aggregation #795

JPrevost · 2024-02-16T19:11:50Z

NOTE: test/controllers/graphql_controller_v2_test.rb has significant changes and GitHub's code review windows will not show them by default. Please make sure to review those changes. The cassettes are less interesting and you can probably just confirm one or two indeed scrub the sensitive data but ignore the rest as it was all automated.

Why are these changes being introduced:

geo ui has reqeusted the ability to filter on additional fields

Relevant ticket(s):

How does this address that need:

Adds aggregation in OpenSearch for fixed date_ranges
Adds aggregation in OpenSearch for places by utilizing Subjects with a kind of Dublin Core; Spatial
Adds GraphQL aggregation capabilities for date_ranges and places

Not included

Aggregations for Access Type and Data Type are not yet included as we are still working out exactly how that data should be stored

Document any side effects to this change:

The default index we used in tests was changed from the old value to all-current which should make it easier to generate new tests and cassettes in the future as the old value (timdex-prod) was no longer used anywhere but in our tests.

Changing aggregations required rebuilding most of our cassettes. While I was rebuilding most of them, I opted to rebuild them all. This includes sensitive data scrubbing automatically when using AWS opensearch instances (srubs credentials and URI of the instance). The README has instructions on how to generate cassettes successfully.

I generated cassettes from our current dev1 data, which means a lot of explicitly checked for values changed in the tests. While updating tests for value changes, I also took this opportunity to remove most of the remnants from the v2 naming we had used during the transition from v1. I have not yet renamed the test file, as I was worried that would be more complicated of a review. A follow on change to just rename that file will be forthcoming.

Developer

All new ENV is documented in README
All new ENV has been added to Heroku Pipeline, Staging and Prod
ANDI or Wave has been run in accordance to
our guide and
all issues introduced by these changes have been resolved or opened as new
issues (link to those issues in the Pull Request details above)
Stakeholder approval has been confirmed (or is not needed)

Code Reviewer

The commit message is clear and follows our guidelines
(not just this pull request message)
There are appropriate tests covering any new functionality
The documentation has been updated or is unnecessary
The changes have been verified
New dependencies are appropriate or there were no changes

Requires database migrations?

NO

Includes new or updated dependencies?

NO

matt-bernhardt

Some of these are questions to make sure I'm not mistaking something, but there's at least one test (the eastern vs western hemisphere box test) which I think needs to be tweaked on order to still be valid. Or maybe the test itself isn't meaningful, as it relies on OpenSearch working correctly and not anything that we're actually doing in the application. Regardless, the change as currently proposes seems to allow for incorrect test passage.

I also have a concern about the year bucketing being applied, because the most recent bucket seems inconsistent with the rest of them.

As you mentioned in the PR, I didn't look at each cassette individually - but did inspect a few of the ones that changed to see how the anonymization was being applied.

I'm happy to chat more about this on Tuesday, though - you're right that there was a lot involved in regenerating all these cassettes, and I'm glad that you took this on.

app/models/aggregations.rb

test/models/opensearch_test.rb