Reference

Glossary

197 AWS & networking terms — search or filter by category.

⌕

197 / 197

PBSHPC / Batch

Portable Batch System — traditional on-premises HPC job scheduler software

SlurmHPC / Batch

Simple Linux Utility for Resource Management — popular HPC cluster job scheduler

LSFHPC / Batch

IBM Load Sharing Facility — enterprise batch workload scheduler used on-premises

SSMAWS Services

AWS Systems Manager — manage EC2 instances remotely without SSH; run commands, patch, configure

AthenaAWS Services

Amazon Athena — serverless SQL query service that reads data directly from S3, no database needed

IMDSv2Compute

Instance Metadata Service v2 — more secure way to query EC2 metadata; requires a session token before fetching data

IMDSCompute

Instance Metadata Service — endpoint at 169.254.169.254 that provides info about the running EC2 instance

STSAWS Services

AWS Security Token Service — issues temporary credentials (access key + secret + session token)

CRRAWS Services

Cross-Region Replication — automatically copies S3 objects to a bucket in another AWS region

IAMAWS Services

Identity and Access Management — controls who (users, roles, services) can do what in AWS via policies

AMIAWS Services

Amazon Machine Image — a snapshot template (OS + software) used to launch EC2 instances

EC2AWS Services

Elastic Compute Cloud — virtual servers you rent in AWS; you choose the CPU, RAM, and OS

ALBLoad Balancers

Application Load Balancer — Layer 7 load balancer; routes HTTP/HTTPS by path, host header, or query string

CloudFrontAWS Services

AWS Content Delivery Network — caches content at 400+ edge locations worldwide to reduce latency for users

CDNAWS Services

Content Delivery Network — a network of edge servers that cache and serve content close to users to cut latency

RDSAWS Services

Relational Database Service — managed SQL databases (MySQL, PostgreSQL, etc.) with automated backups and patching

SSOAWS Services

Single Sign-On — log in once to access multiple AWS accounts or applications without re-entering credentials

EC2 fleetCompute

A group of EC2 instances managed together, typically mixing On-Demand and Spot types

FleetCompute

A group of EC2 instances managed together to meet a target capacity or cost, often mixing On-Demand and Spot

Spot InstancesCompute

Spare EC2 capacity at up to 90% discount — AWS can reclaim them with a 2-minute interruption notice

Reserved InstancesCompute

1- or 3-year commitment to a specific EC2 instance config. Standard RIs: up to 72% off On-Demand. Convertible RIs: up to 66% off (can change instance family/OS). All Upfront gives max discount.

Savings PlansCompute

Flexible commitment to a consistent usage amount ($/hr) for 1-3 years; applies across EC2, Lambda, Fargate

SpotCompute

EC2 Spot Instances — spare AWS capacity at up to 90% discount; can be interrupted with 2-min notice

On-DemandCompute

EC2 On-Demand — pay per second/hour with no commitment; always available, never interrupted

SSL/TLSNetworking

Protocols that encrypt data travelling over a network (in transit)

non-transitiveNetworking

Routing only works directly — VPC Peering: if A peers B and B peers C, A still cannot reach C without its own peering

transitiveNetworking

Routing through a middle point — if A connects to hub and B connects to hub, A can reach B indirectly

Transit GatewayNetworking

Central hub that connects multiple VPCs and on-premises networks with transitive routing (A↔hub↔B)

VPC PeeringNetworking

Direct private network link between two VPCs so they communicate as if on the same network; non-transitive

VPC EndpointNetworking

Private connection from your VPC to AWS services (e.g. S3) without going through the public internet

VPCNetworking

Virtual Private Cloud — your own isolated network in AWS where you launch and control resources

Internet GatewayNetworking

Allows resources in a public subnet to send and receive traffic to/from the internet

IGWNetworking

Internet Gateway — VPC component that allows public subnet resources to reach the internet

NAT GatewayNetworking

Network Address Translation Gateway — lets private subnet instances initiate outbound internet traffic without being publicly reachable

BGPNetworking

Border Gateway Protocol — dynamic routing protocol that exchanges routes between networks; used in Direct Connect and VPN

IPSecNetworking

Internet Protocol Security — encryption suite used to secure VPN tunnels over the public internet

CIDRNetworking

Classless Inter-Domain Routing — defines an IP address range. /16 = 65 536 IPs, /24 = 256 IPs

octetNetworking

Kumpulan 8 bits. IPv4 address ada 4 octets (4 × 8 = 32 bits). Nilai setiap octet = 0–255 kerana 2⁸ = 256 kemungkinan. Contoh: 192.168.100.10 → octet 1=192, octet 2=168, octet 3=100, octet 4=10

subnetNetworking

A subdivision of a VPC. Public subnet has a route to an Internet Gateway; private subnet has no direct internet access

inboundNetworking

Traffic flowing INTO your resource (e.g. an HTTP request arriving at your web server)

outboundNetworking

Traffic flowing OUT FROM your resource (e.g. your server connecting to a database)

deep packet inspectionNetworking

Inspects the full content of network packets — not just headers — to detect malware, intrusions, or policy violations

intrusion preventionNetworking

Actively blocks detected attack patterns in network traffic in real-time before they reach the target

domain filteringNetworking

Allows or blocks traffic based on hostnames/domains (e.g. block *.malicious.com) rather than raw IP addresses

AES-256Security

Advanced Encryption Standard 256-bit — industry-standard symmetric encryption algorithm

SSLSecurity

Secure Sockets Layer — older protocol for encrypting data in transit (now replaced by TLS)

TLSSecurity

Transport Layer Security — modern encryption protocol for data in transit; successor to SSL

NACLSecurity

Network Access Control List — subnet-level firewall; stateless, evaluates every packet, supports both allow and deny rules

DDoSSecurity

Distributed Denial of Service — overwhelming a service with traffic from thousands of sources to make it unavailable

SQL injectionSecurity

Attack where malicious SQL code is slipped into input fields to manipulate or dump a database

XSSSecurity

Cross-Site Scripting — attacker injects scripts into web pages to steal cookies or hijack user sessions

WAFSecurity

Web Application Firewall — filters HTTP/HTTPS requests at Layer 7 to block SQL injection, XSS, bots, and rate-limit abuse

DRTSecurity

DDoS Response Team — AWS experts available 24/7 to help Shield Advanced customers during active attacks

Layer 7Security

Application layer in the OSI model — understands HTTP, HTTPS, DNS. WAF and ALB operate here

Layer 3Security

Network layer in the OSI model — handles IP routing. Shield Standard protects here against volumetric floods

Layer 4Security

Transport layer in the OSI model — handles TCP/UDP ports. Shield protects SYN floods and UDP reflection attacks

KMSAWS Services

AWS Key Management Service — create, store, and manage encryption keys; used for data at rest encryption

SSE-KMSAWS Services

Server-Side Encryption with KMS — objects/data are encrypted at rest automatically using keys in AWS KMS

CMKAWS Services

Customer Managed Key — a KMS key you create and control (rotation, policies, auditing), vs AWS-managed keys

WORMStorage

Write Once Read Many — data can be written exactly once and never modified or deleted after

envelope encryptionEncryption

Data encrypted with a data key; the data key is encrypted by a KMS master key. Only the encrypted data key is stored alongside data

Compliance modeEncryption

S3 Object Lock mode where NO user — not even root — can shorten or delete the retention period

Governance modeEncryption

S3 Object Lock mode where users with special IAM permissions can override retention, unlike Compliance mode

legal holdEncryption

An S3 Object Lock flag that blocks deletion indefinitely with no fixed expiry — removed only when explicitly released

retention periodEncryption

The fixed time window during which an S3 object cannot be deleted or overwritten (used with Object Lock)

Multi-AZDatabase & HA

Multi-Availability Zone — synchronous standby replica in a different AZ; automatic failover for high availability

Read ReplicaDatabase & HA

Asynchronous read-only copy of a database — offloads read queries; can exist in a different region

Availability ZoneDatabase & HA

Isolated data centre cluster within an AWS Region. Each Region has 3+ AZs for redundancy

RPODatabase & HA

Recovery Point Objective — maximum acceptable data loss in time. RPO = 1 hour means you can afford to lose 1 hour of data

RTODatabase & HA

Recovery Time Objective — maximum acceptable downtime. RTO = 4 hours means systems must be back up within 4 hours

EBSStorage

Elastic Block Store — persistent block storage (like a virtual hard drive) attached to one EC2 instance

EFSStorage

Elastic File System — managed NFS file system that can be shared across multiple EC2 instances simultaneously

EFS General PurposeStorage

EFS performance mode with the LOWEST per-operation latency. Recommended for all workloads including web serving, CMS, and data sharing. AWS explicitly recommends General Purpose over Max I/O for all file systems.

EFS Max I/OStorage

Previous-generation EFS performance mode with HIGHER per-operation latency but higher aggregate throughput. For massively parallel HPC workloads with hundreds/thousands of concurrent connections. NOT recommended when low latency is required.

EFS Bursting ThroughputStorage

EFS throughput mode that scales with storage size: baseline 50 KiB/s per GiB stored. A 25 GB file system gets only ~1.25 MiB/s baseline — insufficient for high-throughput workloads. Burst credits allow temporary higher throughput.

EFS Provisioned ThroughputStorage

EFS throughput mode where you specify the exact throughput (in MiB/s) regardless of file system size. Use when throughput requirements exceed what Bursting provides for your storage size.

EFS Elastic ThroughputStorage

Recommended EFS throughput mode that automatically scales throughput up and down based on workload. Pay per use. No need to provision or manage throughput limits.

EFS mount helperStorage

amazon-efs-utils tool that simplifies EFS mounting. Supports -o tls flag to enable encryption in transit via TLS 1.2 + AES-256. Usage: sudo mount -t efs -o tls fs-xxxx /mnt/efs

IOPSStorage

Input/Output Operations Per Second — measures storage throughput. Higher IOPS = faster reads/writes (important for databases)

instance storeCompute

Ephemeral, physically attached storage on the EC2 host. Data is LOST on instance stop, terminate, or hardware failure. Can only be configured at launch time. Faster than EBS (local NVMe) but non-persistent.

EBS-backedCompute

EC2 instance whose root volume is an EBS volume — data persists across stop/start, survives instance termination (if DeleteOnTermination=false)

instance store-backedCompute

EC2 instance whose root volume is an instance store — data is lost on stop or termination. Rare in modern workloads.

Elastic VolumesStorage

EBS feature allowing you to increase volume size, change volume type, or adjust IOPS/throughput on a live, attached volume — no downtime, no detach required. After resize, extend the OS filesystem (growpart + resize2fs on Linux).

EBS snapshotStorage

Point-in-time backup of an EBS volume stored in S3. Incremental — only changed blocks are saved after the first snapshot. Used to create new volumes or copy data across regions.

AWS BackupAWS Services

Centralized managed backup service supporting EFS, EBS, RDS, DynamoDB, S3, FSx, EC2 AMIs, and more. Provides policy-based scheduling, retention rules, cross-region/cross-account copies, restore, and compliance reporting via Backup Audit Manager.

awsvpcContainers

ECS networking mode that gives each task its own ENI and private IP — enables per-task security groups and VPC Flow Logs visibility

ENIContainers

Elastic Network Interface — virtual network card attached to an EC2 instance or ECS task; carries a private IP, security groups, and MAC address

bridgeContainers

Docker bridge networking mode for ECS — tasks share the EC2 host's network interface; does not support per-task security groups

hostContainers

ECS networking mode where tasks share the EC2 host's network namespace; port conflicts possible when running multiple copies of the same task

EBKEncryption

Ephemeral Backup Key — AES-256 key generated inside an HSM to encrypt CloudHSM cluster backup data; exists only for the duration of the backup

PBKEncryption

Persistent Backup Key — long-lived key stored in CloudHSM that wraps (encrypts) the EBK; encrypted backup is stored in S3 in the same region as the cluster

ABACIAM & Policies

Attribute-Based Access Control — IAM policy technique that uses resource tags (e.g. aws:ResourceTag/Environment) to grant or deny access dynamically without hard-coding ARNs

NotPrincipalIAM & Policies

IAM policy element that matches all principals EXCEPT those listed; used with Deny to restrict a resource to only a specified set of users/roles

CORSS3 Features

Cross-Origin Resource Sharing — browser security mechanism; an S3 CORS config specifies AllowedOrigin, AllowedMethod, and AllowedHeader to permit browser JS from a different domain to make requests

AnycastNetworking

IP routing method where multiple servers share the same IP addresses; network routes to the nearest one. Used by Global Accelerator — clients always reach the closest PoP automatically

PoPNetworking

Point of Presence — AWS edge location where Global Accelerator or CloudFront receives traffic before routing it over the AWS backbone to the origin region

DRAStorage

Data Repository Association — FSx for Lustre feature linking an S3 bucket to the file system so objects are lazily imported and processed files can be exported back to S3

Standby stateCompute

ASG lifecycle state where an instance is removed from the active pool (stops receiving traffic) without being terminated — used for in-place maintenance; returns to InService when done

cooldown periodCompute

ASG setting (default 300 s) that blocks new scaling actions after a scaling event to let the fleet stabilize before evaluating whether more scaling is needed

InServiceCompute

Normal running state for an Auto Scaling group instance — registered with the load balancer and receiving traffic

edge-to-edge routingNetworking

VPC peering limitation: gateways (IGW, NAT Gateway, VGW/VPN, Direct Connect, Gateway VPC endpoint) in one VPC cannot be used by resources in a peered VPC. Each VPC must have its own gateways.

transitive peeringNetworking

Attempting to route traffic through a middle VPC via two peering connections — NOT supported. If A↔B and B↔C, A cannot reach C through B. Use Transit Gateway for transitive routing.

delete markerS3 Features

S3 versioning concept — a DELETE on a key without specifying a version ID creates a delete marker (not actual deletion); all previous versions remain and incur storage charges. To permanently remove a version, specify its version ID.

noncurrent versionsS3 Features

In a versioned S3 bucket, all versions of an object that are not the current (latest) version. Lifecycle rules can expire noncurrent versions to reduce storage cost.

secondary VPC CIDRNetworking

An additional IPv4 CIDR block associated with an existing VPC (up to 5 total). Used to expand IP space without recreating or migrating the VPC. New subnets are created from the secondary CIDR.

SNINetworking

Server Name Indication — TLS extension where the client includes the hostname in the ClientHello message. Enables a single ALB HTTPS listener to hold multiple TLS certificates and return the correct one per domain.

OACCloudFront

Origin Access Control — CloudFront feature that restricts S3 bucket access to only the CloudFront distribution via SigV4 request signing. Supports SSE-KMS encrypted buckets (OAI does not). Bucket stays private.

OAICloudFront

Origin Access Identity — legacy CloudFront feature to restrict S3 access; replaced by OAC. Does NOT support SSE-KMS encrypted S3 buckets.

statefulArchitecture

Remembers connection state — allowed return traffic is automatically permitted without an explicit rule (like Security Groups)

statelessArchitecture

Does not track connections — every packet is evaluated independently against rules, both directions need rules (like NACLs)

VPC LinkNetworking

API Gateway feature that creates a private connection between API Gateway and a Network Load Balancer (NLB) inside a VPC. Enables private integration: API Gateway → VPC Link → NLB → backend (EC2, ECS, or on-premises via Direct Connect). No public internet traffic.

API Gateway throttlingMessaging

API Gateway rate limiting: steady-state rate (requests/sec) + burst rate (spike capacity). Protects backend from overload. Configured per stage or per method. Returns HTTP 429 Too Many Requests when exceeded.

API cachingMessaging

API Gateway can cache endpoint responses for a configurable TTL (300s default). Reduces backend calls for repeated identical requests. Cache capacity: 0.5 GB–237 GB. Supports encryption at rest.

Lambda authorizerMessaging

Custom API Gateway access control via a Lambda function. Two types: TOKEN (bearer token like JWT/OAuth) and REQUEST (uses request params like headers/query strings). Returns IAM policy allowing/denying access.

Usage PlanMessaging

API Gateway feature to control access via API keys: sets throttling limits (rate/burst) and quota (max requests/day/week/month) per API key. Used to monetize or tier API access.

SQS Long PollingMessaging

SQS ReceiveMessage waits up to 20 seconds for a message before returning. Reduces API calls and cost vs short polling (which returns immediately even if queue empty). Set ReceiveMessageWaitTimeSeconds > 0.

SQS Short PollingMessaging

Default SQS behavior — ReceiveMessage returns immediately even if no messages are available. Results in many empty responses and high API call costs for frequently polled queues.

Visibility TimeoutMessaging

Period during which SQS hides a retrieved message from other consumers (default 30s, max 12 hours). Must exceed processing time to prevent duplicate processing. Extend per-message with ChangeMessageVisibility.

SQS FIFOMessaging

SQS First-In-First-Out queue: guarantees strict message ordering within a MessageGroupId and exactly-once processing (5-min deduplication window). Max 3000 TPS with batching. Use for ordered, deduplicated processing.

Dead Letter QueueMessaging

SQS/SNS queue for messages that fail processing after max receive count. Used for debugging and isolating problematic messages. Configure via RedrivePolicy on the source queue.

SNS fan-outMessaging

Pattern where one SNS topic fans out to multiple SQS queues, Lambda functions, or HTTP endpoints simultaneously. Enables parallel processing of the same message by multiple subscribers.

Step FunctionsMessaging

AWS serverless workflow orchestration service. Coordinates Lambda functions, ECS tasks, and other services in multi-step workflows with built-in error handling, retries, branching, and parallel execution. Two types: Standard (exactly-once, up to 1 year) and Express (at-least-once, up to 5 min).

RDS Multi-AZDatabase & HA

High availability feature: synchronous standby replica in a different AZ. Failover is automatic — the endpoint CNAME is updated to point to the standby. Standby is NOT accessible for reads.

Aurora ServerlessDatabase & HA

Aurora capacity mode that automatically scales compute up/down based on demand and can pause when idle. Ideal for intermittent, unpredictable, or infrequent workloads. v2 scales in fine-grained ACU increments.

DynamoDB PITRDatabase & HA

DynamoDB Point-in-Time Recovery: enables continuous incremental backups. Restore to any second in the last 35 days. No performance impact. Provides RPO near-zero. Different from on-demand backups.

DynamoDB Auto ScalingDatabase & HA

Automatically adjusts DynamoDB provisioned read/write capacity units based on actual traffic using AWS Application Auto Scaling. Set target utilization % and min/max capacity bounds.

AWS DMSAWS Services

AWS Database Migration Service: migrates databases to AWS with minimal downtime. Supports homogeneous (MySQL→MySQL) and heterogeneous (Oracle→Aurora) migrations. CDC (Change Data Capture) mode keeps source and target synchronized during cutover.

CloudFormation DeletionPolicyDatabase & HA

Attribute on a CloudFormation resource that controls what happens when the resource is deleted: Delete (default), Retain (keep resource), Snapshot (create final snapshot — supported by RDS, EBS, ElastiCache, not S3).

Secrets ManagerAWS Services

AWS service for storing and automatically rotating secrets (database passwords, API keys). Built-in rotation for RDS/Aurora/Redshift/DocumentDB via managed Lambda rotation function. Charged per secret per month.

Aurora ReplicasDatabase & HA

Read-only replicas within an Aurora cluster. Serve read traffic with typically <10ms lag. Can be scaled automatically with Aurora Auto Scaling based on CPU/connections. Up to 15 replicas per cluster.

Amazon ComprehendML / AI

AWS managed NLP service: sentiment analysis, entity recognition, key phrase extraction, topic modeling, language detection. No ML expertise needed. Analyzes text from support tickets, social media, documents.

Amazon LexML / AI

Conversational AI service for building chatbots and voice interfaces. Provides NLU (Natural Language Understanding) + ASR (speech recognition). Powers Amazon Alexa. Manages multi-turn conversation state.

Amazon TextractML / AI

Extracts text and structured data from scanned documents (PDF, images). Goes beyond OCR: extracts key-value pairs from forms and data from tables. Used for invoice/contract/report processing.

Amazon KendraML / AI

Intelligent enterprise search service powered by ML. Indexes and searches across diverse data sources (S3, SharePoint, databases) including unstructured documents (PDF, Word, email). Natural language query understanding.

Amazon RekognitionML / AI

Image and video analysis service. Detects objects, scenes, faces, text, and explicit content. Facial recognition and comparison. NOT for document text extraction (use Textract) or NLP (use Comprehend).

Amazon PollyML / AI

Text-to-speech service: converts written text to lifelike audio. Supports multiple voices and languages. NOT for chatbots (use Lex) or text analysis (use Comprehend).

Amazon MSKAnalytics

Managed Streaming for Apache Kafka — fully managed Kafka cluster on AWS. Handles broker provisioning, patching, storage scaling. NO SSH to brokers. Lambda integration requires Event Source Mapping. MSK Serverless auto-scales capacity.

Amazon OpenSearch ServiceAnalytics

Managed Elasticsearch/OpenSearch cluster. Full-text search with relevance scoring, spell-checking, synonym support, fuzzy matching. Used for e-commerce product search, log analytics, application monitoring.

AWS Data ExchangeAnalytics

AWS marketplace for subscribing to and accessing third-party data products (market data, financial data, regulatory filings). Data delivered directly to your S3 bucket. Handles licensing and subscription management.

Amazon Kinesis Data StreamsAnalytics

Real-time data streaming service. Captures GB/s of data with sub-second latency. Retains data 1–365 days. Integrates with Lambda, Firehose, Analytics. Use for real-time dashboards, ML, and event-driven architectures.

AWS GlueAnalytics

Serverless ETL (Extract, Transform, Load) service. Crawls data sources to build a metadata catalog, runs Spark-based transformation jobs, and orchestrates data pipelines. NOT a streaming or search service.

bastion hostSecurity

EC2 in a public subnet used as the only SSH/RDP entry point into private subnet instances — connect to bastion first, then hop to private instances

jump hostSecurity

Another name for a bastion host — a hardened EC2 in a public subnet that you jump through to reach private subnet instances

Elastic IPArchitecture

Static public IPv4 address allocated to your AWS account — stays fixed until released, survives instance stop/start; NAT Gateway requires one

penetration testingSecurity

Authorised simulated attack to find vulnerabilities — AWS allows pentest on 8 services (EC2, RDS, CloudFront, Aurora, API GW, Lambda, Lightsail, Elastic Beanstalk) without prior approval

AUPSecurity

AWS Acceptable Use Policy — defines what is permitted and prohibited on AWS infrastructure, including security testing rules

Backup and RestoreArchitecture

DR strategy with highest RTO/RPO and lowest cost. Data backed up to DR region; entire infrastructure must be redeployed during disaster. Best for non-critical workloads or data protection only.

Pilot LightArchitecture

DR strategy: core data continuously replicated to DR region, minimal infrastructure pre-provisioned (switched off). Scale up only when disaster strikes. Lower RTO than Backup & Restore, higher than Warm Standby.

Warm StandbyArchitecture

DR strategy: scaled-down but fully functional copy of production running in DR region at all times. Scale up to full capacity during failover. Faster RTO than Pilot Light, costs more.

Active/ActiveArchitecture

Multi-Site DR strategy: full production workload running in multiple AWS Regions simultaneously. Traffic load-balanced across regions. Lowest RTO/RPO (near zero), highest cost. Also called Hot Standby.

IaCArchitecture

Infrastructure as Code — managing and provisioning infrastructure through machine-readable configuration files (e.g. CloudFormation, Terraform) instead of manual processes. Enables repeatable, version-controlled deployments.

Blue/Green deploymentArchitecture

Deployment strategy with two identical environments (Blue = current, Green = new). Traffic switched from Blue to Green after testing. Instant rollback by switching back. Zero downtime deployments.

Canary deploymentArchitecture

Gradual traffic shift to a new version — e.g. 5% of users get v2, 95% get v1. Monitor for errors, then increase percentage. Reduces blast radius of bad deployments.

Fan-outArchitecture

Pattern where one SNS topic delivers messages to multiple SQS queues simultaneously. Decouples producers from consumers. E.g. one S3 upload event triggers 3 different Lambda functions via SNS→SQS.

Event-drivenArchitecture

Architecture pattern where services communicate by producing and consuming events, not direct calls. Services are loosely coupled. E.g. S3 upload → EventBridge → Lambda → SQS → EC2.

IdempotencyArchitecture

Property where an operation produces the same result regardless of how many times it is called. Critical for distributed systems — if a Lambda is retried, it should not double-process an order.

ElasticityArchitecture

Ability to automatically scale resources UP during peak demand and scale DOWN when demand drops. Distinct from scalability (which just means ability to scale up). Elasticity = auto up + auto down.

Fault toleranceArchitecture

Ability of a system to continue operating correctly despite the failure of one or more components. Achieved through redundancy (Multi-AZ, Multi-Region), circuit breakers, and graceful degradation.

Shared Responsibility ModelArchitecture

AWS is responsible for security OF the cloud (physical infra, hardware, hypervisor). Customer is responsible for security IN the cloud (data, OS patches, IAM config, application security, encryption).

ServerlessArchitecture

No servers to manage — AWS handles provisioning, scaling, patching. Pay only for what you use. Examples: Lambda (compute), Fargate (containers), DynamoDB (database), Aurora Serverless (DB), S3 (storage).

MicroservicesArchitecture

Architecture where an application is broken into small, independent services each with a single responsibility. Services communicate via APIs or events. Each can be deployed, scaled, and updated independently.

ContainerizationArchitecture

Packaging an application and all its dependencies into a portable container image. Containers run consistently across environments. Docker is the standard format; ECS and EKS orchestrate containers on AWS.

Cold startCompute

Latency when Lambda creates a new execution environment from scratch (download code, initialize runtime). Adds 100ms–1s+ delay. Mitigated by Provisioned Concurrency (pre-warmed environments) or keeping functions warm.

Provisioned ConcurrencyCompute

Lambda feature that pre-initializes a specified number of execution environments, eliminating cold starts. Pay extra for pre-warmed capacity. Use for latency-sensitive production workloads.

Reserved ConcurrencyCompute

Sets the maximum concurrent executions for a specific Lambda function (throttles above this limit). Also guarantees that capacity is reserved — other functions cannot use it. Setting to 0 = disable function.

Lambda LayerCompute

A .zip archive containing libraries, runtime, or other dependencies shared across multiple Lambda functions. Reduces deployment package size. Up to 5 layers per function.

FargateCompute

Serverless compute engine for containers — runs ECS tasks or EKS pods without managing EC2 instances. AWS manages the underlying infrastructure. Pay per vCPU and memory used by each task.

EC2 Placement GroupCompute

Controls how EC2 instances are placed on physical hardware. Cluster = same rack, low latency HPC. Spread = different racks, max resilience. Partition = groups on separate partitions for large distributed apps.

Spot interruptionCompute

AWS can reclaim Spot Instances with 2-minute warning when capacity is needed. Applications must handle interruption gracefully. Use Spot for fault-tolerant, stateless, or checkpointable workloads.

ARNIAM & Policies

Amazon Resource Name — unique identifier for every AWS resource. Format: arn:partition:service:region:account-id:resource. Example: arn:aws:s3:::my-bucket. Used in IAM policies to specify exact resources.

PrincipalIAM & Policies

Entity that can make requests to AWS: IAM user, IAM role, AWS service (e.g. Lambda), federated user, or AWS account. Specified in resource-based policies to define WHO can access the resource.

Identity-based policyIAM & Policies

IAM policy attached to an IAM identity (user, group, role). Defines what actions that identity can perform on which resources. Most common policy type. Can be AWS managed, customer managed, or inline.

Resource-based policyIAM & Policies

IAM policy attached to a resource (S3 bucket, SQS queue, KMS key, Lambda). Defines who can access the resource and what they can do. Enables cross-account access without requiring role assumption.

Permissions boundaryIAM & Policies

IAM managed policy that sets the MAXIMUM permissions an IAM entity (user or role) can have. Even if identity-based policies grant more, the boundary caps it. Does not grant permissions by itself.

Managed policyIAM & Policies

Standalone IAM policy that can be attached to multiple users, groups, or roles. AWS managed = created by AWS (e.g. AdministratorAccess). Customer managed = created by you. Easier to reuse and update than inline.

Inline policyIAM & Policies

IAM policy embedded directly into one specific user, group, or role. Not reusable. Deleted when the entity is deleted. Use sparingly — managed policies are preferred for maintainability.

Trust policyIAM & Policies

Resource-based policy on an IAM role that defines which principals (services, accounts, users) can ASSUME the role. Every role has exactly one trust policy. Example: allow EC2 service to assume the role.

OUIAM & Policies

Organizational Unit — a container for AWS accounts within AWS Organizations. SCPs can be applied to OUs to restrict all accounts within. OUs can be nested. Management account is at the root.

Permission deniedIAM & Policies

When IAM evaluation results in Deny. Explicit Deny always overrides Allow. A missing Allow = implicit Deny. Order: explicit Deny → SCP limit → permissions boundary → resource policy → identity policy.

PrivateLinkNetworking

AWS technology powering Interface VPC Endpoints. Creates private connectivity to AWS services or third-party services via ENIs in your VPC. Traffic never leaves AWS network. Used by all Interface endpoints.

EgressNetworking

Traffic flowing OUT of your network or VPC to the internet or another network. Egress charges apply when data leaves AWS. NAT Gateway, Internet Gateway, and Direct Connect all handle egress traffic.

IngressNetworking

Traffic flowing INTO your network or VPC from the internet or another network. Security groups and NACLs control ingress. No AWS charge for ingress data transfer.

FQDNNetworking

Fully Qualified Domain Name — complete domain name specifying exact location in DNS hierarchy. Example: my-alb-1234567890.us-east-1.elb.amazonaws.com. Alias records in Route 53 can point to FQDNs.

LatencyNetworking

Time delay between a request being sent and the response being received. Affected by geographic distance, network congestion, processing time. Reduce with: CloudFront, Global Accelerator, multi-region deployments.

ThroughputNetworking

Amount of data transferred per unit of time (MB/s, Gbps) or records processed per second. Higher throughput = more capacity. EBS st1 optimized for throughput; EBS io2 optimized for IOPS (random I/O).

NLBLoad Balancers

Network Load Balancer — Layer 4 (TCP/UDP) load balancer. Handles millions of requests per second with ultra-low latency. Has static IP per AZ. Use for: non-HTTP traffic, static IP requirement, extreme performance needs.

CLBLoad Balancers

Classic Load Balancer — legacy (Layer 4 + Layer 7). Use ALB or NLB instead for new deployments. Still used for EC2-Classic or very old apps.

Target GroupLoad Balancers

ALB/NLB routing destination — a group of registered targets (EC2 instances, IPs, Lambda functions, other ALBs). Health checks run against each target. Listener rules route traffic to specific target groups by path/host/header.

Pre-signed URLS3 Features

Time-limited URL that grants temporary access to a private S3 object without requiring AWS credentials. Generated using AWS SDK with an expiry time (max 7 days). Used to share private files securely with external users.

S3 VersioningS3 Features

S3 feature that preserves every version of an object. When an object is overwritten or deleted, the previous version is retained. Enables recovery from accidental deletes or overwrites. Required for S3 Object Lock.

S3 Lifecycle PolicyS3 Features

Automated rules to transition S3 objects between storage classes or expire (delete) them after a defined period. Example: transition to Glacier after 30 days, delete after 1 year. Reduces storage costs automatically.

S3 Transfer AccelerationS3 Features

Speeds up S3 uploads by routing traffic through CloudFront edge locations instead of directly to S3. Data enters AWS network at the nearest edge point. Useful for large file uploads from distant locations.

SRRS3 Features

Same-Region Replication — automatically copies S3 objects between buckets in the SAME region. Use for: compliance (separate account copy), log aggregation, live replicas for test environments.

DAX

DynamoDB Accelerator — fully managed in-memory cache for DynamoDB. Microsecond read latency (vs millisecond for DynamoDB). Drop-in compatible, no application code changes. Only for READ caching.

TTL

Time To Live — DynamoDB feature that auto-deletes items after a specified timestamp. No extra cost. Items expired within ~48 hours. Useful for session data, temp records, log expiry. Does NOT consume write capacity.

DLQ

Dead Letter Queue — SQS queue that receives messages which failed processing after the maximum number of retries (maxReceiveCount). Used for debugging, manual reprocessing, alerting. Prevents poison-pill messages blocking the queue.

FIFO Queue

First-In-First-Out SQS queue. Guarantees: (1) exact ordering of messages, (2) exactly-once processing. Max 300 transactions/second (3,000 with batching). Use when order matters (e.g. financial transactions, sequential steps).