ORCA API Reference

Overview

The purpose of this page is to give developers information on how to use the ORCA API and explain the expected inputs, outputs and paths. The API can be used to get metadata information about a granule, recovery job or to get information on internal reconciliation reports and accepts and responds with JSON payloads at various HTTPS endpoints. All ORCA APIs use the POST method. All API endpoints use AWS IAM authorization.

When using the Load Balancer endpoint for GraphQL, encode your query as a string, and store it in the request body under a query key. For example, if using the Python requests library, the command would look like this:

r = requests.post(f"http://{orca_graphql_load_balancer_dns_name}:5000/graphql/", json={"query": query}, headers={})

Query examples will be shown in the appropriate sections.

warning

If an Aurora Serverless database is used for data-storage, and it has not been accessed in some time, then it may take 30-40 seconds for the database to become available. As AWS limits API Gateway invocations to 30 seconds, with no option of increase, this can cause API invocations to result in HTTP error code 504. Where available, make use of the GraphQL Load Balancer endpoints. Otherwise, include appropriate handling/retry code.

Catalog reporting API

The catalog/reconcile API call provides a user with the current listing of the ORCA catalog that can be used to reconcile granule and file information against a master catalog. For example, comparing the Cumulus primary archive holdings against the ORCA holdings to find discrepancies. Catalog reporting API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/catalog/reconcile

Catalog reporting API input

An example of the API input body is shown below:

{
  "pageIndex": 0,
  "providerId": ["lpdaac"],
  "collectionId": ["MOD14A1__061"],
  "granuleId": ["MOD14A1.061.A23V45.2020235"],
  "startTimestamp": "628021800000",
  "endTimestamp": "628021900000"
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
pageIndex	`int`	The 0-based index of the results page to return.	Yes
endTimestamp	`int`	Cumulus granule createdAt end-time for date range to compare data, in milliseconds since 1 January 1970 UTC.	Yes
providerId	`Array[str]`	The unique ID of the provider making the request.	No
collectionId	`Array[str]`	The unique ID of collection to compare.	No
granuleId	`Array[str]`	The unique ID of granule to compare.	No
startTimestamp	`int`	Cumulus granule createdAt start time for date range to compare data, in milliseconds since 1 January 1970 UTC.	No

Catalog reporting API output

An example of the API output is shown below:

{
  "anotherPage": false,
  "granules": [
    {
      "providerId": "lpdaac",
      "collectionId": "MOD14A1___061",
      "id": "MOD14A1.061.A23V45.2020235",
      "createdAt": "628021850000",
      "executionId": "u654-123-Yx679",
      "ingestDate": "628021950000",
      "lastUpdate": "628021970000",
      "files": [
        {
          "name": "MOD14A1.061.A23V45.2020235.2020240145621.hdf",
          "cumulusArchiveLocation": "cumulus-bucket",
          "orcaArchiveLocation": "orca-archive",
          "keyPath": "MOD14A1/061/032/MOD14A1.061.A23V45.2020235.2020240145621.hdf",
          "sizeBytes": 100934568723,
          "hash": "ACFH325128030192834127347",
          "hashType": "SHA-256",
          "storageClass": "GLACIER",
          "version": "VXCDEG902"
        }
      ]
    }
  ]
}

The following table lists the fields in the output:

Name	Data Type	Description
anotherPage	`Boolean`	Indicates if more results can be retrieved on another page.
granules	`Array[Object]`	A list of objects representing individual files to copy.
providerId	`int`	The unique ID of the provider making the request.
collectionId	`str`	The unique ID of collection to compare.
id	`str`	The unique ID of the granule.
createdAt	`int`	The time, in milliseconds since 1 January 1970 UTC, data was originally ingested into cumulus.
executionId	`str`	Step function execution ID from AWS.
ingestDate	`int`	The time, in milliseconds since 1 January 1970 UTC, that the data was originally ingested into ORCA.
lastUpdate	`int`	The time, in milliseconds since 1 January 1970 UTC, that information was updated.
files	`Array[Object]`	Description and status of the files within the given granule.
name	`str`	The name and extension of the file.
cumulusArchiveLocation	`str`	Cumulus bucket the file resides in.
orcaArchiveLocation	`str`	Archive bucket the file resides in.
keyPath	`str`	S3 path to the file including the file name and extension, but not the bucket.
sizeBytes	`str`	Size in bytes of the file. From Cumulus ingest.
hash	`str`	Checksum hash of the file provided by Cumulus.
hashType	`str`	Hash type used to calculate the hash value of the file.
storageClass	`str`	The class of storage containing the file.
version	`str`	AWS provided version of the file.

The API returns status code 200 on success, 400 if pageIndex or endTimestamp is missing and 500 if an error occurs when querying the database.

Recovery granules API

The recovery/granules API call relates to an ORCA recovery job status and returns detailed status of the granule.

Recovery granules API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/recovery/granules

Recovery granules API input

An example of the API input body is shown below:

{
  "collectionId": "collectionName___001",
  "granuleId": "MOD14A1.061.H5V12.2020312.141531789",
  "asyncOperationId": "43c9751b-9498-4733-90d8-56b1458e0f85"
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
collectionId	`str`	The ID of the collection containing the granule.	Yes
granuleId	`str`	The ID of the granule to retrieve status for.	Yes
asyncOperationId	`str`	The unique ID of the asyncOperation. May apply to a request that covers multiple granules.	No

Recovery granules API output

An example of the API output is shown below:

{
  "collectionId": "collectionName___001",
  "granuleId": "MOD14A1.061.H5V12.2020312.141531789",
  "asyncOperationId": "43c9751b-9498-4733-90d8-56b1458e0f85",
  "files": [
    {
      "fileName": "f1.doc",
      "status": "pending"
    },
    {
      "fileName": "f2.pdf",
      "status": "error",
      "error_message": "Access Denied"
    },
    {
      "fileName": "f3.txt",
      "status": "success"
    }
  ],
  "restoreDestination": "bucket_name",
  "requestTime": 628021800000,
  "completionTime": 628021900000
}

The following table lists the fields in the output:

Name	Data Type	Description
collectionId	`str`	The ID of the collection containing the granule retrieved.
granuleId	`str`	The ID of the granule retrieved.
asyncOperationId	`str`	The unique ID of the asyncOperation.
files	`Array[Object]`	Description and status of the files within the given granule.
fileName	`str`	The name and extension of the file.
status	`str`	The status of the restoration of the file. May be 'pending', 'staged', 'success', or 'error'.
errorMessage	`str`	If the restoration of the file showed error, the error will be stored here.
restoreDestination	`str`	The name of the archive bucket the granule is being copied to.
requestTime	`int`	The time, in milliseconds since 1 January 1970 UTC, when the request to restore the granule was initiated.
completionTime	`int`	The time, in milliseconds since 1 January 1970 UTC, when all granule_files were in an end state.

The API returns status code 200 on success, 400 if input is in incorrect format, 500 if an error occurs when querying the database and 404 if not found.

Recovery jobs API

The recovery/jobs API call returns detailed status for a particular recovery job. Recovery job API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/recovery/jobs

Recovery jobs API input

An example of the API input body is shown below:

{
  "asyncOperationId": "43c9751b-9498-4733-90d8-56b1458e0f85"
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
asyncOperationId	`str`	The unique ID of the asyncOperation of the recovery job.	Yes

Recovery jobs API output

An example of the API output is shown below:

{
  "asyncOperationId": "43c9751b-9498-4733-90d8-56b1458e0f85",
  "jobStatusTotals": {
    "pending": 1,
    "success": 1,
    "error": 0,
    "staged": 0
  },
  "granules": [
    {
      "collectionId": "collectionName___001",
      "granuleId": "6c8d0c8b-4f9a-4d87-ab7c-480b185a0250",
      "status": "error"
    },
    {
      "collectionId": "collectionName___001",
      "granuleId": "b5681dc1-48ba-4dc3-877d-1b5ad97e8276",
      "status": "pending"
    }
  ]
}

The following table lists the fields in the output:

Name	Data Type	Description
asyncOperationId	`str`	The unique ID of the asyncOperation.
jobStatusTotals	`Object`	Sum of how many granules are in each particular restoration status ('pending', 'staged', 'success', or 'error').
granules	`Array[Object]`	An array representing each granule being copied as part of the job.
collectionId	`str`	The id of the collection containing the granule.
granuleId	`str`	The id of the granule.
status	`str`	The status of the restoration of the granule. May be 'pending', 'staged', 'success', or 'error'.

The API returns status code 200 on success, 400 if input is in incorrect format, 500 if an error occurs when querying the database, and 404 if not found.

Internal Reconcile report jobs API

The orca/datamanagement/reconciliation/internal/jobs API call receives page index from end user and returns available internal reconciliation jobs from the Orca database. Internal reconcile report jobs API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/datamanagement/reconciliation/internal/jobs

Internal Reconcile report jobs API input

An example of the API input body is shown below:

{
  "pageIndex": 0
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
pageIndex	`int`	The 0-based index of the results page to return.	Yes

Internal Reconcile report jobs API output

An example of the API output is shown below:

{
  "anotherPage": false,
  "jobs": [
    {
      "id": 826,
      "orcaArchiveLocation": "PREFIX-orca-primary",
      "status": "success",
      "inventoryCreationTime": 1652227200000,
      "lastUpdate": 1652299312334,
      "errorMessage": null,
      "reportTotals": {
        "orphan": 0,
        "phantom": 1,
        "catalogMismatch": 1
      }
    },
    {
      "id": 793,
      "orcaArchiveLocation": "doctest-orca-primary",
      "status": "error",
      "inventoryCreationTime": 1652140800000,
      "lastUpdate": 1652198623479,
      "errorMessage": "Error while posting mismatches to database.",
      "reportTotals": {
        "orphan": 2,
        "phantom": 1,
        "catalogMismatch": 0
      }
    }
  ]
}

The following table lists the fields in the output:

Name	Data Type	Description
anotherPage	`bool`	Indicates if more results can be retrieved on another page.
jobs	`Array[Object]`	The jobs on the page.
id	`int`	The unique ID of the reconciliation job.
orcaArchiveLocation	`str`	Archive bucket the reconciliation targets.
status	`str`	Current status of the job. `getting S3 list`, `staged`, `generating reports`, `error`, or `success`
inventoryCreationTime	`int`	The time, in milliseconds since 1 January 1970 UTC, of inventory report initiation time from the s3 manifest.
lastUpdate	`int`	The time, in milliseconds since 1 January 1970 UTC, when status was last updated.
errorMessage	`str` or `null`	Critical error the job ran into that prevented it from finishing.
reportTotals	`Object`	The number of error reports of each type.
orphan	`int`	Number of files that have records in the S3 archive bucket but are missing in the ORCA catalog.
phantom	`int`	Number of files that have records in the ORCA catalog but are missing from S3 bucket.
catalogMismatch	`int`	Number of files exist in the ORCA S3 bucket but have different metadata values in the ORCA catalog.

The API returns status code 200 on success, 400 if jobId or pageIndex are missing and 500 if an error occurs.

Internal Reconcile report orphan API

The orca/datamanagement/reconciliation/internal/jobs/job/{jobid}/orphans API call receives job id and page index from end user and returns reporting information of files that have records in the S3 archive bucket but are missing in the ORCA catalog from the internal reconciliation job. Internal reconcile report orphan API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/datamanagement/reconciliation/internal/jobs/job/{jobid}/orphans

Internal Reconcile report orphan API input

An example of the API input body is shown below:

{
  "jobId": 123,
  "pageIndex": 0
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
jobId	`int`	The unique job ID of the reconciliation job.	Yes
pageIndex	`int`	The 0-based index of the results page to return.	Yes

Internal Reconcile report orphan API output

An example of the API output is shown below:

{
  "jobId": 123,
  "anotherPage": false,
  "orphans": [
    {
      "keyPath": "MOD09GQ/006/MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
      "s3Etag": "d41d8cd98f00b204e9800998ecf8427",
      "s3FileLastUpdate": 1654878716000,
      "s3SizeInBytes": 6543277389,
      "s3StorageClass": "GLACIER"
    }
  ]
}

The following table lists the fields in the output:

Name	Data Type	Description
jobId	`str`	The unique ID of the reconciliation job.
anotherPage	`Boolean`	Indicates if more results can be retrieved on another page.
orphans	`Array[Object]`	An array representing each orphan if available.
keyPath	`str`	Key path and filename of the object in S3 bucket.
s3Etag	`str`	ETag of the object in S3 bucket.
s3FileLastUpdate	`int`	The time, in milliseconds since 1 January 1970 UTC, of last update of the object in S3 bucket.
s3SizeInBytes	`int`	Size in bytes of the object in S3 bucket.
s3StorageClass	`str`	AWS storage class the object is in the S3 bucket.

The API returns status code 200 on success, 400 if jobId or pageIndex are missing and 500 if an error occurs.

Internal Reconcile report phantom API

The getPhantomPage query receives job id and page parameters and returns reporting information of files that have records in the ORCA catalog but are missing from S3 bucket.

Internal Reconcile report phantom query example

query MyQuery {
  getPhantomPage(jobId: 2443, pageParameters: {
    limit: 2,
    direction: next,
    cursor: null
  }) {
    ... on PhantomPage {
      startCursor
      endCursor
      items {
        jobId
        collectionId
        granuleId
        filename
        keyPath
        orcaEtag
        orcaGranuleLastUpdate
        orcaSizeInBytes
        orcaStorageClass
      }
    }
    ... on ErrorGraphqlTypeInterface {
      __typename
      message
    }
    ... on InternalServerErrorGraphqlType {
      __typename
      exceptionMessage
      message
      stackTrace
    }
  }
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
jobId	`int8`	The unique job ID of the reconciliation job.	Yes
pageParameters	`dict`	Contains paging information.	No

Internal Reconcile report phantom API output

An example of the API output is shown below:

{
  "data": {
    "getPhantomPage": {
      "startCursor": "eyJqb2JfaWQiOiAyNDQzLjAsICJjb2xsZWN0aW9uX2lkIjogImludGVncmF0aW9uQ29sbGVjdGlvbk5hbWVfX19pbnRlZ3JhdGlvbkNvbGxlY3Rpb25WZXJzaW9uIiwgImdyYW51bGVfaWQiOiAiaW50ZWdyYXRpb25DdW11bHVzR3JhbnVsZUlkIiwgImtleV9wYXRoIjogIk1PRDA5R1EvMDA2L01PRDA5R1EuQTIwMTcwMjUuaDIxdjAwLjAwNi4yMDE3MDM0MDY1MTA1X25kdmkuanBnIn0=",
      "endCursor": "eyJqb2JfaWQiOiAyNDQzLjAsICJjb2xsZWN0aW9uX2lkIjogImludGVncmF0aW9uQ29sbGVjdGlvbk5hbWVfX19pbnRlZ3JhdGlvbkNvbGxlY3Rpb25WZXJzaW9uIiwgImdyYW51bGVfaWQiOiAiaW50ZWdyYXRpb25DdW11bHVzR3JhbnVsZUlkIiwgImtleV9wYXRoIjogIk1PRDA5R1EvMDA2L01PRDA5R1EuQTIwMTcwMjUuaDIxdjAwLjAwNi4yMDE3MDM0MDY1MTA1X25kdmkuanBnIn0=",
      "items": [
        {
          "collectionId": "CollectionName",
          "filename": "MOD09GQ.A2017025.h21v00.006.2017034065105_ndvi.jpg",
          "jobId": 2443,
          "granuleId": "CumulusGranuleId",
          "keyPath": "MOD09GQ/006/MOD09GQ.A2017025.h21v00.006.2017034065105_ndvi.jpg",
          "orcaEtag": "\"81f4b6c158d25f1fe916ea52e99d1700\"",
          "orcaSizeInBytes": 6,
          "orcaStorageClass": "GLACIER",
          "orcaGranuleLastUpdate": 1672417036578
        }
      ]
    }
  }
}

The following table lists the fields in the output:

Name	Data Type	Description
startCursor	`str`	Cursor value for paging.
endCursor	`str`	Cursor value for paging.
items	`Array[Object]`	An array representing each phantom if available.
jobId	`int8`	The unique ID of the reconciliation job.
collectionId	`str`	Cumulus Collection ID value from the ORCA catalog.
granuleId	`str`	Cumulus granuleID value from the ORCA catalog.
filename	`str`	Filename of the object from the ORCA catalog.
keyPath	`str`	key path and filename of the object in the ORCA catalog.
orcaEtag	`str`	etag of the object as reported in the ORCA catalog.
orcaGranuleLastUpdate	`int8`	The time, in milliseconds since 1 January 1970 UTC, of last update of the object as reported in the ORCA catalog.
orcaSizeInBytes	`int8`	Size in bytes of the object as reported in the ORCA catalog.
orcaStorageClass	`str`	AWS storage class the object is in the Orca catalog.

If an error occurs, error fields will be returned instead. Possible errors: InternalServerErrorGraphqlType

Internal Reconcile report phantom API (API Gateway, Deprecated)

The orca/datamanagement/reconciliation/internal/jobs/job/{jobid}/phantoms API call receives job id and page index from end user and returns reporting information of files that have records in the ORCA catalog but are missing from S3 bucket. Internal reconcile report phantom API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/datamanagement/reconciliation/internal/jobs/job/{jobid}/phantoms

Internal Reconcile report phantom API input

An example of the API input body is shown below:

{
  "jobId": 123,
  "pageIndex": 0
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
jobId	`int`	The unique job ID of the reconciliation job.	Yes
pageIndex	`int`	The 0-based index of the results page to return.	Yes

Internal Reconcile report phantom API output

An example of the API output is shown below:

{
  "jobId": 123,
  "anotherPage": false,
  "phantoms": [
    {
      "collectionId": "MOD09GQ___061",
      "granuleId": "MOD09GQ.A2017025.h21v00.006.2017034065109",
      "filename": "MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
      "keyPath": "MOD09GQ/006/MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
      "orcaEtag": "d41d8cd98f00b204e9800998ecf8427",
      "orcaGranuleLastUpdate": 1654878715868,
      "orcaSizeInBytes": 6543277389,
      "orcaStorageClass": "GLACIER"
    }
  ]
}

The following table lists the fields in the output:

Name	Data Type	Description
jobId	`str`	The unique ID of the reconciliation job.
anotherPage	`Boolean`	Indicates if more results can be retrieved on another page.
phantoms	`Array[Object]`	An array representing each phantom if available.
collectionId	`str`	Cumulus Collection ID value from the ORCA catalog.
granuleId	`str`	Cumulus granuleID value from the ORCA catalog.
filename	`str`	Filename of the object from the ORCA catalog.
keyPath	`str`	key path and filename of the object in the ORCA catalog.
orcaEtag	`str`	ETag of the object as reported in the ORCA catalog.
orcaGranuleLastUpdate	`int`	The time, in milliseconds since 1 January 1970 UTC, of last update of the object as reported in the ORCA catalog.
orcaSizeInBytes	`int`	Size in bytes of the object as reported in the ORCA catalog.
orcaStorageClass	`str`	AWS storage class the object is in the Orca catalog.

The API returns status code 200 on success, 400 if jobId or pageIndex are missing and 500 if an error occurs.

Internal Reconcile report mismatch API

The getMismatchPage query receives job id and page parameters and returns reporting information of files that exist in the ORCA S3 bucket but have different metadata values in the ORCA catalog.

Internal Reconcile report mismatch query example

query MyQuery {
  getMismatchPage(jobId: 2443, pageParameters: {
    limit: 2,
    direction: next,
    cursor: null
  }) {
    ... on MismatchPage {
      startCursor
      endCursor
      items {
        jobId
        collectionId
        granuleId
        filename
        keyPath
        cumulusArchiveLocation
        orcaEtag
        s3Etag
        orcaGranuleLastUpdate
        s3FileLastUpdate
        orcaSizeInBytes
        s3SizeInBytes
        orcaStorageClass
        s3StorageClass
        discrepancyType
        comment
      }
    }
    ... on ErrorGraphqlTypeInterface {
      __typename
      message
    }
    ... on InternalServerErrorGraphqlType {
      __typename
      exceptionMessage
      message
      stackTrace
    }
  }
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
jobId	`int8`	The unique job ID of the reconciliation job.	Yes
pageParameters	`dict`	Contains paging information.	No

Internal Reconcile report mismatch API output

An example of the API output is shown below:

{
  "data": {
    "getMismatchPage": {
      "startCursor": "eyJqb2JfaWQiOiAyNDQzLjAsICJjb2xsZWN0aW9uX2lkIjogImludGVncmF0aW9uQ29sbGVjdGlvbk5hbWVfX19pbnRlZ3JhdGlvbkNvbGxlY3Rpb25WZXJzaW9uIiwgImdyYW51bGVfaWQiOiAiaW50ZWdyYXRpb25DdW11bHVzR3JhbnVsZUlkIiwgImtleV9wYXRoIjogIk1PRDA5R1EvMDA2L01PRDA5R1EuQTIwMTcwMjUuaDIxdjAwLjAwNi4yMDE3MDM0MDY1MTA1X25kdmkuanBnIn0=",
      "endCursor": "eyJqb2JfaWQiOiAyNDQzLjAsICJjb2xsZWN0aW9uX2lkIjogImludGVncmF0aW9uQ29sbGVjdGlvbk5hbWVfX19pbnRlZ3JhdGlvbkNvbGxlY3Rpb25WZXJzaW9uIiwgImdyYW51bGVfaWQiOiAiaW50ZWdyYXRpb25DdW11bHVzR3JhbnVsZUlkIiwgImtleV9wYXRoIjogIk1PRDA5R1EvMDA2L01PRDA5R1EuQTIwMTcwMjUuaDIxdjAwLjAwNi4yMDE3MDM0MDY1MTA1X25kdmkuanBnIn0=",
      "items": [
        {
          "jobId": 2443,
          "collectionId": "MOD09GQ___061",
          "granuleId": "MOD09GQ.A2017025.h21v00.006.2017034065109",
          "filename": "MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
          "keyPath": "MOD09GQ/006/MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
          "cumulusArchiveLocation": "cumulus-public",
          "orcaEtag": "d41d8cd98f00b204e9800998ecf8427",
          "s3Etag": "1f78ve1d3f41vbhg4nbb4kjhong4x14",
          "orcaGranuleLastUpdate": 1654878715868,
          "s3FileLastUpdate": 1654878716000,
          "orcaSizeInBytes": 6543277389,
          "s3SizeInBytes": 1987618731,
          "orcaStorageClass": "GLACIER",
          "s3StorageClass": "GLACIER",
          "discrepancyType": "etag, size_in_bytes",
          "comment": null
        }
      ]
    }
  }
}

The following table lists the fields in the output:

Name	Data Type	Description
startCursor	`str`	Cursor value for paging.
endCursor	`str`	Cursor value for paging.
items	`Array[Object]`	An array representing each mismatch if available.
jobId	`int8`	The unique ID of the reconciliation job.
collectionId	`str`	Cumulus Collection ID value from the ORCA catalog.
granuleId	`str`	Cumulus granuleID value from the ORCA catalog.
filename	`str`	Filename of the object from the ORCA catalog.
keyPath	`str`	key path and filename of the object in the ORCA catalog.
orcaEtag	`str`	ETag of the object as reported in the ORCA catalog.
s3Etag	`str`	ETag of the object as reported in the S3 bucket
orcaGranuleLastUpdate	`int8`	The time, in milliseconds since 1 January 1970 UTC, of last update of the object as reported in the ORCA catalog.
s3FileLastUpdate	`int8`	The time, in milliseconds since 1 January 1970 UTC, that information was updated in the S3 bucket.
orcaSizeInBytes	`int8`	Size in bytes of the object as reported in the ORCA catalog.
s3SizeInBytes	`int8`	Size in bytes of the object as reported in the S3 bucket.
orcaStorageClass	`str`	AWS storage class the object is in the Orca catalog.
s3StorageClass	`str`	AWS storage class the object is in the S3 bucket.
discrepancyType	`str`	Type of discrepancy found during reconciliation.
comment	`str`	Any additional context for the mismatch.

If an error occurs, error fields will be returned instead. Possible errors: InternalServerErrorGraphqlType

Internal Reconcile report mismatch API (API Gateway, Deprecated)

The orca/datamanagement/reconciliation/internal/jobs/job/{jobid}/mismatches API call receives job id and page index from end user and returns reporting information of files that exist in the ORCA S3 bucket but have different metadata values in the ORCA catalog. Internal reconcile report mismatch API input invoke URL example: https://example.execute-api.us-west-2.amazonaws.com/orca/datamanagement/reconciliation/internal/jobs/job/{jobid}/mismatches

Internal Reconcile report mismatch API input

An example of the API input body is shown below:

{
  "jobId": 123,
  "pageIndex": 0
}

The following table lists the fields in the input:

Name	Data Type	Description	Required
jobId	`int`	The unique job ID of the reconciliation job.	Yes
pageIndex	`int`	The 0-based index of the results page to return.	Yes

Internal Reconcile report mismatch API output

An example of the API output is shown below:

{
  "jobId": 123,
  "anotherPage": false,
  "mismatches": [
    {
      "collectionId": "MOD09GQ___061",
      "granuleId": "MOD09GQ.A2017025.h21v00.006.2017034065109",
      "filename": "MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
      "keyPath": "MOD09GQ/006/MOD09GQ.A2017025.h21v00.006.2017034065109.hdf",
      "cumulusArchiveLocation": "cumulus-public",
      "orcaEtag": "d41d8cd98f00b204e9800998ecf8427",
      "s3Etag": "1f78ve1d3f41vbhg4nbb4kjhong4x14",
      "orcaGranuleLastUpdate": 1654878715868,
      "s3FileLastUpdate": 1654878716000,
      "orcaSizeInBytes": 6543277389,
      "s3SizeInBytes": 1987618731,
      "orcaStorageClass": "GLACIER",
      "s3StorageClass": "GLACIER",
      "discrepancyType": "etag, size_in_bytes",
      "comment": null
    }
  ]
}

The following table lists the fields in the output:

Name	Data Type	Description
jobId	`str`	The unique ID of the reconciliation job.
anotherPage	`Boolean`	Indicates if more results can be retrieved on another page.
mismatches	`Array[Object]`	An array representing each mismatch if available.
collectionId	`str`	Cumulus Collection ID value from the ORCA catalog.
granuleId	`str`	Cumulus granuleID value from the ORCA catalog.
filename	`str`	Filename of the object from the ORCA catalog.
keyPath	`str`	key path and filename of the object in the ORCA catalog.
cumulusArchiveLocation	`str`	Expected S3 bucket the object is located in Cumulus. From the ORCA catalog.
orcaEtag	`str`	ETag of the object as reported in the ORCA catalog.
s3Etag	`str`	ETag of the object as reported in the S3 bucket
orcaGranuleLastUpdate	`int`	The time, in milliseconds since 1 January 1970 UTC, of last update of the object as reported in the ORCA catalog.
s3FileLastUpdate	`int`	The time, in milliseconds since 1 January 1970 UTC, that information was updated in the S3 bucket.
orcaSizeInBytes	`int`	Size in bytes of the object as reported in the ORCA catalog.
s3SizeInBytes	`int`	Size in bytes of the object as reported in the S3 bucket.
orcaStorageClass	`str`	AWS storage class the object is in the Orca catalog.
s3StorageClass	`str`	AWS storage class the object is in the S3 bucket.
discrepancyType	`str`	Type of discrepancy found during reconciliation.
comment	`str`	Any additional context for the mismatch.

The API returns status code 200 on success, 400 if jobId or pageIndex are missing and 500 if an error occurs.

Generic Types

PageParameters

When retrieving a page, direction will start you at either the start, or end of your dataset. Use next for the start, and previous for the end.

Name	Data Type	Description	Required
limit	`int`	The 0-based index of the results page to return. Defaults to 100.	No
direction	`str`	'next' or 'previous', depending on which direction to take. Defaults to 'next'.	No
cursor	`str`	The cursor denoting the start of the page to retrieve (non-inclusive). Defaults to 'null', which will retrieve the first page.	No

startCursor and endCursor values will be provided in the page output, excepting cases where the page contains no items. To retrieve a page that you have not yet retrieved, modify your query with the endCursor value from the last page you retrieved.

GraphQL Errors

All errors returned from GraphQL will implement this interface. As such, you can count on the following properties:

Name	Data Type	Description
message	`str`	A string describing what happened to cause the error.
__typename	`str`	The type of error returned. This will match a section below.

InternalServerErrorGraphqlType

In addition to the default fields, the following fields will be returned:

Name	Data Type	Description
exceptionMessage	`str`	As this is a catch-all for unexpected errors, this will contain technical messages of unknown format.
stackTrace	`str`	Technical information for use in debugging. Contact the ORCA team with any questions.

Overview​

Catalog reporting API​

Catalog reporting API input​

Catalog reporting API output​

Recovery granules API​

Recovery granules API input​

Recovery granules API output​

Recovery jobs API​

Recovery jobs API input​

Recovery jobs API output​

Internal Reconcile report jobs API​

Internal Reconcile report jobs API input​

Internal Reconcile report jobs API output​

Internal Reconcile report orphan API​

Internal Reconcile report orphan API input​

Internal Reconcile report orphan API output​

Internal Reconcile report phantom API​

Internal Reconcile report phantom query example​

Internal Reconcile report phantom API output​

Internal Reconcile report phantom API (API Gateway, Deprecated)​

Internal Reconcile report phantom API input​

Internal Reconcile report phantom API output​

Internal Reconcile report mismatch API​

Internal Reconcile report mismatch query example​

Internal Reconcile report mismatch API output​

Internal Reconcile report mismatch API (API Gateway, Deprecated)​

Internal Reconcile report mismatch API input​

Internal Reconcile report mismatch API output​

Generic Types​

PageParameters​

GraphQL Errors​

InternalServerErrorGraphqlType​

Overview

Catalog reporting API

Catalog reporting API input

Catalog reporting API output

Recovery granules API

Recovery granules API input

Recovery granules API output

Recovery jobs API

Recovery jobs API input

Recovery jobs API output

Internal Reconcile report jobs API

Internal Reconcile report jobs API input

Internal Reconcile report jobs API output

Internal Reconcile report orphan API

Internal Reconcile report orphan API input

Internal Reconcile report orphan API output

Internal Reconcile report phantom API

Internal Reconcile report phantom query example

Internal Reconcile report phantom API output

Internal Reconcile report phantom API (API Gateway, Deprecated)

Internal Reconcile report phantom API input

Internal Reconcile report phantom API output

Internal Reconcile report mismatch API

Internal Reconcile report mismatch query example

Internal Reconcile report mismatch API output

Internal Reconcile report mismatch API (API Gateway, Deprecated)

Internal Reconcile report mismatch API input

Internal Reconcile report mismatch API output

Generic Types

PageParameters

GraphQL Errors

InternalServerErrorGraphqlType