The reference App Identity implementations provide tools to cross-verify that the implementations are compatible. These tools should be used by external implementations for cross-verification as well. To cross-verify an implementation:
- Generate a test suite and verify it with one or more reference implementations.
- Generate a test suite using a reference implementation and verify it against the implementation under test.
For example, we run suite generation for each of the Elixir, Ruby, and Typescript implementations, and then verify each generated suite against teach implementation, including self-verification (running the Elixir suite against the Elixir implementation, etc.).
Each implementation has tooling to generate and run integration suites.
An App Identity integration test suite is a generated JSON file containing a
test Suite
. Simplified, it looks like this:
interface Suite {
name: string;
version: string;
description?: string;
spec_version: number;
tests: {
description: string;
app: {
id: number | string;
secret: string;
version: number;
config?: { fuzz?: number };
};
proof: string;
expect: "pass" | "fail";
required: boolean;
spec_version: number;
}[];
}
For more detail on how this is made available, see Integration Tooling.
Each conforming App Identity implementation is required to implement a suite generator (that generates the test suite JSON) and a suite runner (that parses and executes the test suite JSON).
The suite generator is a tool that uses a combination of the public API and internal testing tools to generate a test suite that will be verified by one or more other implementations. Conforming implementations must generate the tests described in Required Tests and should generate the tests described in Optional Tests.
The test description tables below are informative. The test descriptions in
required.yaml
and optional.yaml
are
normative, although suite implementations usually use the JSON
representations.
For more detail on the test description files, see Integration Tooling.
Required tests should be generated using only the public API of the implementation.
In the table below, if either Nonce
or Fuzz
are blank, the test uses the
default values as described in the spec. Otherwise, the included values must
be used.
Description | Expect | App | Proof | Nonce | Fuzz |
---|---|---|---|---|---|
App V1, Proof V1 | pass |
1 | 1 | ||
App V1, Proof V2 | pass |
1 | 2 | ||
App V1, Proof V3 | pass |
1 | 3 | ||
App V1, Proof V4 | pass |
1 | 4 | ||
App V2, Proof V2 | pass |
2 | 2 | ||
App V2, Proof V3 | pass |
2 | 3 | ||
App V2, Proof V4 | pass |
2 | 4 | ||
App V3, Proof V3 | pass |
3 | 3 | ||
App V3, Proof V4 | pass |
3 | 4 | ||
App V4, Proof V4 | pass |
4 | 4 | ||
App V1, Proof V2 (custom fuzz) | pass |
1 | 2 | 300 |
|
App V1, Proof V3 (custom fuzz) | pass |
1 | 3 | 300 |
|
App V1, Proof V4 (custom fuzz) | pass |
1 | 4 | 300 |
|
App V2, Proof V2 (custom fuzz) | pass |
2 | 2 | 300 |
|
App V2, Proof V3 (custom fuzz) | pass |
2 | 3 | 300 |
|
App V2, Proof V4 (custom fuzz) | pass |
2 | 4 | 300 |
|
App V3, Proof V3 (custom fuzz) | pass |
3 | 3 | 300 |
|
App V3, Proof V4 (custom fuzz) | pass |
3 | 4 | 300 |
|
App V4, Proof V4 (custom fuzz) | pass |
4 | 4 | 300 |
|
App V1, Proof V2 old timestamp | fail |
1 | 2 | 20060102T150405.333Z |
|
App V1, Proof V3 old timestamp | fail |
1 | 3 | 20060102T150405.333Z |
|
App V1, Proof V4 old timestamp | fail |
1 | 4 | 20060102T150405.333Z |
|
App V2, Proof V2 old timestamp | fail |
2 | 2 | 20060102T150405.333Z |
|
App V2, Proof V3 old timestamp | fail |
2 | 3 | 20060102T150405.333Z |
|
App V2, Proof V4 old timestamp | fail |
2 | 4 | 20060102T150405.333Z |
|
App V3, Proof V3 old timestamp | fail |
3 | 3 | 20060102T150405.333Z |
|
App V3, Proof V4 old timestamp | fail |
3 | 4 | 20060102T150405.333Z |
|
App V4, Proof V4 old timestamp | fail |
4 | 4 | 20060102T150405.333Z |
|
App V1, Proof V2 old timestamp (custom fuzz) | fail |
1 | 2 | 20060102T150405.333Z |
300 |
App V1, Proof V3 old timestamp (custom fuzz) | fail |
1 | 3 | 20060102T150405.333Z |
300 |
App V1, Proof V4 old timestamp (custom fuzz) | fail |
1 | 4 | 20060102T150405.333Z |
300 |
App V2, Proof V2 old timestamp (custom fuzz) | fail |
2 | 2 | 20060102T150405.333Z |
300 |
App V2, Proof V3 old timestamp (custom fuzz) | fail |
2 | 3 | 20060102T150405.333Z |
300 |
App V2, Proof V4 old timestamp (custom fuzz) | fail |
2 | 4 | 20060102T150405.333Z |
300 |
App V3, Proof V3 old timestamp (custom fuzz) | fail |
3 | 3 | 20060102T150405.333Z |
300 |
App V3, Proof V4 old timestamp (custom fuzz) | fail |
3 | 4 | 20060102T150405.333Z |
300 |
App V4, Proof V4 old timestamp (custom fuzz) | fail |
4 | 4 | 20060102T150405.333Z |
300 |
Required tests are also described in required.json, which could be used for code generation.
Optional tests require additional tooling (likely used in the implementation's
unit tests) in order to craft invalid payloads. There are two sets of optional
tests. All optional tests are expected to fail
.
The first set of tests are similar to the required fail
tests, but require the
construction of a custom timestamp nonce using a timestamp offset from the
current time.
Description | App | Proof | Nonce | Fuzz |
---|---|---|---|---|
App V1, Proof V2 offset timestamp | 1 | 2 | -11 minutes |
|
App V1, Proof V3 offset timestamp | 1 | 3 | -11 minutes |
|
App V1, Proof V4 offset timestamp | 1 | 4 | -11 minutes |
|
App V2, Proof V2 offset timestamp | 2 | 2 | -11 minutes |
|
App V2, Proof V3 offset timestamp | 2 | 3 | -11 minutes |
|
App V2, Proof V4 offset timestamp | 2 | 4 | -11 minutes |
|
App V3, Proof V3 offset timestamp | 3 | 3 | -11 minutes |
|
App V3, Proof V4 offset timestamp | 3 | 4 | -11 minutes |
|
App V4, Proof V4 offset timestamp | 4 | 4 | -11 minutes |
|
App V1, Proof V2 offset timestamp (custom fuzz) | 1 | 2 | -6 minutes |
300 |
App V1, Proof V3 offset timestamp (custom fuzz) | 1 | 3 | -6 minutes |
300 |
App V1, Proof V4 offset timestamp (custom fuzz) | 1 | 4 | -6 minutes |
300 |
App V2, Proof V2 offset timestamp (custom fuzz) | 2 | 2 | -6 minutes |
300 |
App V2, Proof V3 offset timestamp (custom fuzz) | 2 | 3 | -6 minutes |
300 |
App V2, Proof V4 offset timestamp (custom fuzz) | 2 | 4 | -6 minutes |
300 |
App V3, Proof V3 offset timestamp (custom fuzz) | 3 | 3 | -6 minutes |
300 |
App V3, Proof V4 offset timestamp (custom fuzz) | 3 | 4 | -6 minutes |
300 |
App V4, Proof V4 offset timestamp (custom fuzz) | 4 | 4 | -6 minutes |
300 |
The second set of tests require the explicit construction of bad payloads. Where the App version column is empty, the tests may be generated with V1 apps only, but it is recommended that all appropriate combinations be generated.
The tests described as Incorrect Proof ID
use a different id when generating
the padlock and proof than is generated for the included app. That is, if the
app id is provided as decafbad
, the padlock and proof might be generated with
deadbeef
. The exact values used do not matter, but must differ.
The tests described as Incorrect Secret
generate the padlock and proof with a
different secret than is generated for the included app. That is, if the app
secret is iaccepttherisk
, the padlock and proof might be generated with
myvoiceismypassword
. The exact values used do not matter, but must differ.
The tests described as Mismatched Padlock
generate the padlock with different
data than the proof. In our unit tests, we usually do this with a nonce value of
bad padlock
for building the padlock, and the normal nonce value when
generating the proof.
Description | App | Proof | Nonce |
---|---|---|---|
App V1, Proof V1, Empty Nonce | 1 | 1 | empty |
App V1, Proof V1, Bad Nonce | 1 | 1 | n:once |
Proof V2, Bad Nonce | 2 | 2006-01-02T15:04:05.333Z |
|
Proof V3, Bad Nonce | 3 | 2006-01-02T15:04:05.333Z |
|
Proof V4, Bad Nonce | 4 | 2006-01-02T15:04:05.333Z |
|
Proof V2, Non-Timestamp Nonce | 2 | nonce |
|
Proof V3, Non-Timestamp Nonce | 2 | nonce |
|
Proof V4, Non-Timestamp Nonce | 2 | nonce |
|
Proof V1, Incorrect Proof ID | 1 | ||
Proof V2, Incorrect Proof ID | 2 | ||
Proof V3, Incorrect Proof ID | 3 | ||
Proof V4, Incorrect Proof ID | 4 | ||
Proof V1, Incorrect Secret | 1 | ||
Proof V2, Incorrect Secret | 2 | ||
Proof V3, Incorrect Secret | 3 | ||
Proof V4, Incorrect Secret | 4 | ||
Proof V1, Mismatched Padlock | 1 | ||
Proof V2, Mismatched Padlock | 2 | ||
Proof V3, Mismatched Padlock | 3 | ||
Proof V4, Mismatched Padlock | 4 |
Optional tests are also described in optional.json, which could be used for code generation.
The suite runner is a tool that reads a provided test suite and verifies it against the public API of the implementation under test. The suite runner must use the TAP format (version 14). The runner should accept multiple suite files and merge them into a single set of tests for output.
The runner must be able to switch between normal and strict execution, and should be able to enable diagnostic output. The default run mode should be normal, non-diagnostic output.
Each suite run must output a TAP comment indicating the name and version of the runner implementation and the name and version of the implementation that created the suite. It should look something like this:
# app_identity for Elixir 1.0.0 (spec 4) testing app_identity for Ruby 1.0.0 (spec 4)
Each test specifies a major specification version indicating the conformance
requirement. If a test specification version exceeds that supported by the
implementation under test, it must be skipped as ok
with a message:
TAP Version 14
1..76
# app_identity for Elixir 1.0.0 (spec 4) testing app_identity for Ruby 1.0.0 (spec 5)
ok 1 - App V1, Proof V1
ok 2 - App V1, Proof V2
ok 3 - App V1, Proof V3
…
ok 75 - Proof V4, Mismatched Padlock
ok 76 - Basic Proof V5 Support # SKIP unsupported spec version (4 < 5)
In normal mode, optional tests that fail will be flagged as TODO
. The test
runner must return a failure status code if any required tests fail, but
must not do so if any optional tests fail.
TAP Version 14
1..75
# app_identity for Elixir 1.0.0 (spec 4) testing app_identity for Ruby 1.0.0 (spec 4)
ok 1 - App V1, Proof V1
ok 2 - App V1, Proof V2
not ok 3 - App V1, Proof V3
…
not ok 75 - Proof V4, Mismatched Padlock # TODO optional failing test
In strict mode, all tests are considered required, unless there is a spec version mismatch. If any test fails, the runner must return a failure status code.
TAP Version 14
1..75
# app_identity for Elixir 1.0.0 (spec 4) testing app_identity for Ruby 1.0.0 (spec 4)
ok 1 - App V1, Proof V1
ok 2 - App V1, Proof V2
not ok 3 - App V1, Proof V3
…
not ok 75 - Proof V4, Mismatched Padlock
If diagnostic output can be enabled, it must use the YAML diagnostic format to output the details of the failure. This may require the use of non-public APIs in the implementation.
TAP Version 14
1..75
# app_identity for Elixir 1.0.0 (spec 4) testing app_identity for Ruby 1.0.0 (spec 4)
ok 1 - App V1, Proof V1
ok 2 - App V1, Proof V2
ok 3 - App V1, Proof V3
…
not ok 75 - Proof V4, Mismatched Padlock
---
message: padlock does not match
...
There is tooling present that assists with the definition and development of the integration suite JSON schema and the definition of the integration suites.
There is a Justfile
to convert the support files from their canonical formats
to the commonly-used support formats.
Integration suites are defined by a JSON schema, schema.json
.
This file is generated from shema.ts
using
ts-json-schema-generator
.
If a suite specification modification is required, the shema.ts
should be modified directly and the JSON schema should be generated with
make schema.json
.
The tests that should be generated by a suite generator are
described by the files required.yaml
and
optional.yaml
. The test descriptions are used by a suite
generator to generate a test suite. The format is similar to the suite
format, but have some differences used by the
generators.
A simplified test definition looks like this:
type TestDefinition = {
description: string
spec_version: number
expect: 'pass' | 'fail'
app: {
version: number
config?: { fuzz: number }
}
proof: {
id?: string
secret?: string
version: number
}
padlock?: {
nonce?: string
case?: 'random' | 'upper' | 'lower' = 'random'
} | {
value: string
}
nonce?: {
value: string
} | {
offset_minutes: number
} | {
empty: true
}
}
Most values in TestDefinition
can be copied directly to the suite test, but
others values are provided so that an alternative nonce, padlock, or proof can
be generated so that suites may test potentially non-compliant values (negative
testing).
-
description
,spec_version
,expect
, andapp
are copied directly to the suite test. -
proof
is required and requiresversion
so that version upgrade is tested (that is, that a version 1 app can verify version 2, 3, or 4 proofs.)proof
may haveid
and/orsecret
values so that negative tests around incorrectid
values or changedsecret
values may be generated.
-
padlock
is optional. If unspecified, the character case of padlock strings will be randomly determined to be either uppercase or lowercase. If specified, it must be one of two shapes:-
{ nonce?: string; case?: 'random' | 'upper' | 'lower' }
:-
nonce
here specifies an explicit nonce to use, usually for negative tests. -
case
forces the character case of the padlock toupper
orlower
. -
Either is optional, but at least one must be specified.
-
padlock.case
should be ignored if AppIdentity internal functionality is used to build the proof (that is, there is no proof, nonce, or padlock customization).
-
-
{ value: string }
, where thevalue
is a static value that will be used as the padlock, primarily for negative tests. It will not have its case transformed.
-
-
nonce
is optional and if specified, must be one of three shapes:-
{ value: string }
, where thevalue
is a static nonce to be used for computation. -
{ empty: true }
, indicating that an empty nonce is required for this test. -
{ offset_minutes: integer }
, with an integer number of minutes that will be used for computing the offset in order to test timestamp fuzzing.
-
When implementing an integration suite generator, it is recommended that you study one or more of the reference implementation generators:
- Elixir:
support/app_identity/suite/generator.ex
- Ruby:
lib/app_identity/suite/generator.rb
- Typescript:
packages/suite/src/generator.ts
For condensed TAP output, we have included Eric S. Raymond's tapview
version
1.12 in the integration directory.
tapview - a TAP (Test Anything Protocol) viewer in pure POSIX shell
This code is intended to be embedded in your project. The author grants permission for it to be distributed under the prevailing license of your project if you choose, provided that license is OSD-compliant; otherwise the following SPDX tag incorporates the MIT No Attribution license by reference.
SPDX-FileCopyrightText: (C) Eric S. Raymond esr@thyrsus.com SPDX-License-Identifier: MIT-0
This version shipped on 2024-02-05.
A newer version may be available at https://gitlab.com/esr/tapview; check the ship date against the commit list there to see if it might be a good idea to update.