Glossary
AWS Glossary
For terms/items from Amazon/AWS not mentioned in this glossary, please refer to the AWS Glossary.
Cumulus Glossary of Terms
API Gateway
Refers to AWS's API Gateway. Used by the Cumulus API.
ARN
Refers to an AWS "Amazon Resource Name".
For more info, see the AWS documentation.
AWS
See: Amazon Web Services documentation.
AWS Lambda/Lambda Function
AWS's 'serverless' option. Allows the running of code without provisioning a service or managing server/ECS instances/etc.
For more information, see the AWS Lambda documentation.
AWS Access Keys
Access credentials that give you access to AWS to act as a IAM user programmatically or from the command line.
For more information, see the AWS IAM Documentation.
Bucket
An Amazon S3 cloud storage resource.
For more information, see the AWS Bucket Documentation.
CloudFormation
An AWS service that allows you to define and manage cloud resources as a preconfigured block.
For more information, see the AWS CloudFormation User Guide.
Cloudformation Template
A template that defines an AWS Cloud Formation.
For more information, see the AWS intro page.
Cloudwatch
AWS service that allows logging and metrics collections on various cloud resources you have in AWS.
For more information, see the AWS User Guide.
Cloud Notification Mechanism (CNM)
An interface mechanism to support cloud-based ingest messaging. For more information, see PO.DAAC's CNM Schema.
Common Metadata Repository (CMR)
"A high-performance, high-quality, continuously evolving metadata system that catalogs Earth Science data and associated service metadata records". For more information, see NASA's CMR page.
Collection (Cumulus)
Cumulus Collections are logical sets of data objects of the same data type and version.
For more information, see Collections - Data Management Types.
Cumulus Message Adapter (CMA)
A library designed to help task developers integrate step function tasks into a Cumulus workflow by adapting task input/output into the Cumulus Message format.
For more information, see CMA workflow reference page.
Distributed Active Archive Center (DAAC)
Refers to a specific organization that's part of NASA's distributed system of archive centers. For more information see EOSDIS's DAAC page.
Dead Letter Queue (DLQ)
This refers to Amazon SQS Dead-Letter Queues - these SQS queues are specifically configured to capture failed messages from other services/SQS queues/etc to allow for processing of failed messages.
For more on DLQs, see the Amazon Documentation and the Cumulus DLQ feature page.
Developer
Those who setup deployment and workflow management for Cumulus. Sometimes referred to as an integrator. See integrator.
ECS
Amazon's Elastic Container Service. Used in Cumulus by workflow steps that require more flexibility than Lambda can provide.
For more information, see AWS's developer guide.
ECS Activity
An ECS instance run via a Step Function.
Execution (Cumulus)
A Cumulus execution refers to a single execution of a (Cumulus) Workflow.
GIBS
Global Imagery Browse Services
Granule
A granule is the smallest aggregation of data that can be independently managed (described, inventoried, and retrieved). Granules are always associated with a collection, which is a grouping of granules. A granule is a grouping of data files.
IAM
AWS Identity and Access Management.
For more information, see AWS IAMs.
Integrator/Developer
Those who work within Cumulus and AWS for deployments and to manage workflows.
Kinesis
Amazon's platform for streaming data on AWS.
See AWS Kinesis for more information.
Lambda
AWS's cloud service that lets you run code without provisioning or managing servers.
For more information, see AWS's lambda page.
Module (Terraform)
Refers to a terraform module.
Node
See node.js.
Node Package Manager (npm)
Node package manager. Often referred to as npm.
For more information, see npm.
Operator
Those who work within Cumulus to ingest/archive data and manage collections.
PDR
"Polling Delivery Mechanism" used in "DAAC Ingest" workflows.
For more information, see nasa.gov.
Packages (npm)
Npm hosted node.js packages. Cumulus packages can be found on npm's site here
Provider
Data source that generates and/or distributes data for Cumulus workflows to act upon.
For more information, see the Cumulus documentation.
Rule
Rules are configurable scheduled events that trigger workflows based on various criteria.
For more information, see the Cumulus Rules documentation.
S3
Amazon's Simple Storage Service provides data object storage in the cloud. Used in Cumulus to store configuration, data, and more.
For more information, see AWS's S3 page.
SIPS
Science Investigator-led Processing Systems. In the context of DAAC ingest, this refers to data producers/providers.
For more information, see nasa.gov.
SNS
Amazon's Simple Notification Service provides a messaging service that allows publication of and subscription to events. Used in Cumulus to trigger workflow events, track event failures, and others.
For more information, see AWS's SNS page.
SQS
Amazon's Simple Queue Service.
For more information, see AWS's SQS page.
Stack
A collection of AWS resources you can manage as a single unit.
In the context of Cumulus, this refers to a deployment of the cumulus
and data-persistence
modules that is managed by Terraform.
Step Function
AWS's web service that allows you to compose complex workflows as a state machine comprised of tasks (Lambdas, activities hosted on EC2/ECS, some AWS service APIs, etc). See AWS's Step Function Documentation for more information. In the context of Cumulus these are the underlying AWS service used to create Workflows.
Terraform
Terraform is the tool that you will use for deployment and configuration of your Cumulus environment.
- For more information on Terraform and how it works, visit Introduction to Terraform.
- For steps on how to use Terraform for Cumulus, see How to Deploy Cumulus.
Workflows
Workflows are comprised of one or more AWS Lambda Functions and ECS Activities to discover, ingest, process, manage and archive data.