AWS Lambda

AWS Lambda offers Serverless computing that allows applications and services to be built and run without thinking about servers.

With serverless computing, the application still runs on servers, but all the server management is done by AWS.
helps run code without provisioning or managing servers, where you pay only for the compute time when the code is running.

is priced on a pay-per-use basis and there are no charges when the code is not running.
allows the running of code for any type of application or backend service with zero administration.
performs all the operational and administrative activities on your behalf, including capacity provisioning, monitoring fleet health, applying security patches to the underlying compute resources, deploying code, running a web service front end, and monitoring and logging the code.

does not provide access to the underlying compute infrastructure.
handles scalability and availability as it
- provides easy scaling and high availability to the code without additional effort on your part.
- is designed to process events within milliseconds.
- is designed to run many instances of the functions in parallel.
- is designed to use replication and redundancy to provide high availability for both the service and the functions it operates.
- has no maintenance windows or scheduled downtimes for either.
- has a default safety throttle for the number of concurrent executions per account per region (default 1,000 concurrent executions).
- scales by 1,000 concurrent executions every 10 seconds until the account’s concurrency limit is reached (12x faster scaling announced Nov 2023).
- has a higher latency immediately after a function is created, or updated, or if it has not been used recently.
- for any function updates, there is a brief window of time, less than a minute, when requests would be served by both versions
Security
- stores code in S3 and encrypts it at rest and performs additional integrity checks while the code is in use.
- each function runs in its own isolated environment, with its own resources and file system view
- supports Code Signing using AWS Signer, which offers trust and integrity controls that enable you to verify that only unaltered code from approved developers is deployed in the functions.

Functions must complete execution within 900 seconds (15 minutes). The default timeout is 3 seconds. The timeout can be set to any value between 1 and 900 seconds.
AWS Step Functions can help coordinate a series of Lambda functions in a specific order. Multiple functions can be invoked sequentially, passing the output of one to the other, and/or in parallel, while the state is being maintained by Step Functions.
AWS X-Ray helps to trace functions, which provides insights such as service overhead, function init time, and function execution time.

Lambda Provisioned Concurrency provides greater control over the performance of serverless applications.
Lambda SnapStart reduces cold start latency to sub-second for Java, Python, and .NET functions without code changes or additional cost.
Lambda Durable Functions enable multi-step applications and AI workflows with automatic checkpointing, failure recovery, and execution suspension for up to one year.

Lambda Managed Instances allow running Lambda functions on EC2 instances with serverless operational simplicity, enabling access to specialized compute and EC2 commitment-based pricing (up to 72% savings).
Lambda@Edge allows you to run code across AWS locations globally without provisioning or managing servers, responding to end-users at the lowest network latency.
Lambda Extensions allow integration of Lambda with other third-party tools for monitoring, observability, security, and governance.

Compute Savings Plan can help save money for Lambda executions.
CodePipeline and CodeDeploy can be used to automate the serverless application release process.
RDS Proxy provides a highly available database proxy that manages thousands of concurrent connections to relational databases.

Supports Elastic File Store, to provide a shared, external, persistent, scalable volume using a fully managed elastic NFS file system without the need for provisioning or capacity management.
supports Function URLs, a built-in HTTPS endpoint that can be invoked using the browser, curl, and any HTTP client.
supports Response Streaming, allowing functions to send response data to callers as it becomes available, enabling larger payloads and long-running operations with incremental progress reporting.

supports Graviton2 (ARM64) architecture, delivering up to 34% better price-performance compared to x86_64 functions.
supports configurable ephemeral storage (/tmp) between 512 MB and 10,240 MB (10 GB) for data-intensive workloads.
supports up to 10,240 MB (10 GB) of memory with up to 6 vCPUs proportionally allocated.

supports asynchronous invocation payload sizes up to 1 MB (increased from 256 KB in Oct 2025).
supports Advanced Logging Controls with native JSON structured logging for easier search, filter, and analysis of function logs.
supports Recursive Loop Detection that automatically detects and stops recursive invocations between Lambda and supported services (SQS, SNS, S3) to prevent runaway costs.

Functions & Event Sources

Core components of Lambda are functions and event sources.
- Event source – an AWS service or custom application that publishes events.
- Function – a custom code that processes the events.

Lambda Functions

Each function has associated configuration information, such as its name, description, runtime, entry point, and resource requirements

Lambda functions should be designed as stateless
- to allow launching of as many copies of the function as needed as per the demand.
- Local file system access, child processes, and similar artifacts may not extend beyond the lifetime of the request
- The state can be maintained externally in DynamoDB or S3
Lambda Execution role can be assigned to the function to grant permission to access other resources.
Functions have the following restrictions
- Inbound network connections are blocked
- Outbound connections only TCP/IP sockets are supported
- ptrace (debugging) system calls are blocked
- TCP port 25 traffic is also blocked as an anti-spam measure.
Lambda may choose to retain an instance of the function and reuse it to serve a subsequent request, rather than creating a new copy.
Lambda Layers provide a convenient way to package libraries and other dependencies that you can use with your Lambda functions.

Function versions can be used to manage the deployment of the functions.
Function Alias supports creating aliases, which are mutable, for each function version.
Functions are automatically monitored, and real-time metrics are reported through CloudWatch, including total requests, latency, error rates, etc.

Lambda automatically integrates with CloudWatch logs, creating a log group for each function and providing basic application lifecycle event log entries, including logging the resources consumed for each use of that function.
Functions support code written in
- Node.js (Node.js 22, Node.js 24)
- Python (Python 3.12, 3.13, 3.14)
- Java (Java 21, Java 25)
- C# (.NET 8, .NET 10)
- Ruby (Ruby 3.3, 3.4, 4.0)
- Go (using OS-only runtime provided.al2023)
- Rust (using OS-only runtime provided.al2023)
- Custom runtime (provided.al2023)
Container images are also supported.
Supports both x86_64 and arm64 (Graviton2) architectures for all managed runtimes.

Failure Handling
- For S3 bucket notifications and custom events, Lambda will attempt execution of the function three times in the event of an error condition in the code or if a service or resource limit is exceeded.
- For ordered event sources that Lambda polls, e.g. DynamoDB Streams and Kinesis streams, it will continue attempting execution in the event of a developer code error until the data expires.
- Kinesis and DynamoDB Streams retain data for a minimum of 24 hours
- Dead Letter Queues (SNS or SQS) can be configured for events to be placed, once the retry policy for asynchronous invocations is exceeded
- Lambda Destinations can be configured for successful and failed asynchronous invocations (recommended over DLQ).

Read in-depth @ Lambda Functions

Lambda Event Sources

Event Source is an AWS service or developer-created application that produces events that trigger an AWS Lambda function to run
Event source mapping refers to the configuration which maps an event source to a Lambda function.
Event sources can be both push and pull sources
- Services like S3, and SNS publish events to Lambda by invoking the cloud function directly.
- Lambda can also poll resources in services like Kafka, and Kinesis streams that do not publish events to Lambda.

Read in-depth @ Event Sources

Lambda Execution Environment

Lambda invokes the function in an execution environment, which provides a secure and isolated runtime environment.

Execution Context is a temporary runtime environment that initializes any external dependencies of the Lambda function code, e.g. database connections or HTTP endpoints.
When a function is invoked, the Execution environment is launched based on the provided configuration settings i.e. memory and execution time.
After a Lambda function is executed, Lambda maintains the execution environment for some time in anticipation of another function invocation which allows it to reuse the /tmp directory and objects declared outside of the function’s handler method e.g. database connection.

When a Lambda function is invoked for the first time or after it has been updated there is latency for bootstrapping as Lambda tries to reuse the Execution Context for subsequent invocations of the Lambda function
Subsequent invocations perform better performance as there is no need to “cold-start” or initialize those external dependencies
Execution environment
- takes care of provisioning and managing the resources needed to run the function.
- provides lifecycle support for the function’s runtime and any external extensions associated with the function.
Function’s runtime communicates with Lambda using the Runtime API.
Extensions communicate with Lambda using the Extensions API.
Extensions can also receive log messages from the function by subscribing to logs using the Logs API.

Lambda manages Execution Environment creations and deletion, there is no AWS Lambda API to manage Execution Environment.
Execution environments support both standard functions (up to 15 minutes) and Durable Functions (up to one year).

Lambda Execution Environment

Lambda in VPC

Lambda function always runs inside a VPC owned by the Lambda service which isn’t connected to your account’s default VPC

Lambda applies network access and security rules to this VPC and maintains and monitors the VPC automatically.
A function can be configured to be launched in private subnets in a VPC in your AWS account.
Function connected to VPC can access private resources databases, cache instances, or internal services during the execution.

To enable the function to access resources inside the private VPC, additional VPC-specific configuration information that includes private subnet IDs and security group IDs must be provided.
Lambda uses this information to set up ENIs that enables the function to connect securely to other resources within your private VPC.
Functions connected to VPC can’t access the Internet and need a NAT Gateway to access any external resources outside of AWS.

Functions cannot connect directly to a VPC with dedicated instance tenancy, instead, peer it to a second VPC with default tenancy.

Lambda Security

All data stored in ephemeral storage is encrypted at rest with a key managed by AWS.
Lambda functions provide access only to a single VPC. If multiple subnets are specified, they must all be in the same VPC. Other VPCs can be connected using VPC Peering.

Supports Code Signing using AWS Signer, which offers trust and integrity controls that enable you to verify that only unaltered code from approved developers is deployed in the functions.
AWS Lambda can perform the following signature checks at deployment:
- Corrupt signature – This occurs if the code artifact has been altered since signing.
- Mismatched signature – This occurs if the code artifact is signed by a signing profile that is not approved.
- Expired signature – This occurs if the signature is past the configured expiry date.
- Revoked signature – This occurs if the signing profile owner revokes the signing jobs.

For sensitive information, for e.g. passwords, AWS recommends using client-side encryption using AWS Key Management Service – KMS and store the resulting values as ciphertext in your environment variable.
Function code should include the logic to decrypt these values.

Lambda Permissions

IAM – Use IAM to manage access to the Lambda API and resources like functions and layers.

Execution Role – A Lambda function can be provided with an Execution Role, that grants it permission to access AWS services and resources e.g. send logs to CloudWatch and upload trace data to AWS X-Ray.
Function Policy – Resource-based Policies
- Use resource-based policies to give other accounts and AWS services permission to use the Lambda resources.
- Resource-based permissions policies are supported for functions and layers.

Invoking Lambda Functions

Lambda functions can be invoked
- directly using the Lambda console or API, a function URL HTTP(S) endpoint, an AWS SDK, the AWS CLI, and AWS toolkits.
- other AWS services like S3 and SNS invoke the function.
- to read from a stream or queue and invoke the function.
Functions can be invoked
- Synchronously
  - You wait for the function to process the event and return a response.
  - Error handling and retries need to be handled by the Client.
  - Invocation includes API, and SDK for calls from API Gateway.
  - Maximum request/response payload size is 6 MB.
- Asynchronously
  - queues the event for processing and returns a response immediately.
  - handles retries and can send invocation records to a destination for successful and failed events.
  - Invocation includes S3, SNS, and CloudWatch Events
  - can define DLQ for handling failed events. AWS recommends using destinations instead of DLQ.
  - Maximum payload size is 1 MB (increased from 256 KB in Oct 2025).

Lambda SnapStart

Lambda SnapStart reduces cold start latency from several seconds to as low as sub-second, typically with no or minimal code changes.

Supported for Java, Python, and .NET runtimes (Python and .NET became GA in November 2024).
Works by taking a snapshot of the initialized execution environment (memory and disk state) after initialization completes.
When the function is invoked, Lambda resumes the execution environment from the cached snapshot instead of initializing from scratch.

Can improve startup performance by up to 10x for Java functions.
SnapStart is an opt-in capability configured at the function level.
For SnapStart-enabled functions, initialization code can run for up to 15 minutes when creating a snapshot.

Available in most AWS Regions (expanded to 23+ additional regions in 2025).
Unlike Provisioned Concurrency, SnapStart does not incur additional charges for pre-initialized environments.

Lambda Provisioned Concurrency

Lambda Provisioned Concurrency provides greater control over the performance of serverless applications.

When enabled, Provisioned Concurrency keeps functions initialized and hyper-ready to respond in double-digit milliseconds.
Provisioned Concurrency is ideal for building latency-sensitive applications, such as web or mobile backends, synchronously invoked APIs, and interactive microservices.
The amount of concurrency can be increased during times of high demand and lowered or turn it off completely when demand decreases.

If the concurrency of a function reaches the configured level, subsequent invocations of the function have the latency and scale characteristics of regular functions.
Application Auto Scaling can be used to automatically manage provisioned concurrency based on utilization.

Lambda Durable Functions

Lambda Durable Functions (announced Dec 2025) enable multi-step applications and AI workflows with automatic checkpointing and failure recovery.

Durable functions use a checkpoint and replay mechanism (durable execution) to persist progress at specific points in code.
Key capabilities:
- Checkpoints – act as save points, persisting progress at specific moments in the execution.
- Steps – define logical units of work that are automatically checkpointed upon completion.
- Waits – allow pausing execution for up to one year without incurring compute charges (for on-demand functions).
- Automatic Failure Recovery – when a failure occurs, execution resumes from the most recent checkpoint rather than starting over.

Requires the open source Durable Execution SDK (available for Node.js and Python runtimes).
When a function resumes, it replays from the beginning but skips completed work using saved checkpoint results.
Ideal for:
- Human-in-the-loop processes
- AI and LLM orchestration workflows
- Multi-step data processing pipelines
- Long-running approval workflows

Available in 14+ AWS Regions.
Does not require managing additional infrastructure or writing custom state management code.

Lambda Managed Instances

Lambda Managed Instances (announced Nov 2025, re:Invent) allow running Lambda functions on Amazon EC2 instances while maintaining serverless operational simplicity.
AWS handles all infrastructure tasks: instance lifecycle, OS and runtime patching, routing, load balancing, and auto scaling.

Key benefits:
- EC2 Pricing Models – Access to Compute Savings Plans and Reserved Instances for up to 72% discount over On-Demand pricing.
- Specialized Compute – Access to latest-generation EC2 instances including Graviton4, network-optimized, and large-memory instances.
- Multi-concurrency – Each execution environment can handle multiple concurrent requests, improving resource utilization.
- No Cold Starts – Pre-provisioned execution environments eliminate cold start latency.
Organized into Capacity Providers that define compute characteristics (instance type, networking, scaling parameters).

Instances have a maximum 14-day lifetime for security and compliance.
Pricing: Standard Lambda request charges + EC2 instance charges + 15% compute management fee.
Supports Node.js, Java, .NET, and Python runtimes.
Existing Lambda functions can be migrated without code changes (must validate thread safety for multi-concurrency).

Available in US East (N. Virginia, Ohio), US West (Oregon), Asia Pacific (Tokyo), and Europe (Ireland).

Lambda@Edge

Read in-depth @ Lambda@Edge

Lambda Extensions

Lambda Extensions allow integration of Lambda with other third-party tools for monitoring, observability, security, and governance.
Extensions run as companion processes within the Lambda execution environment.

Internal extensions run as part of the runtime process; external extensions run as separate processes.
Extensions can run during the invocation phase and up to 2 seconds during the shutdown phase.

Lambda Concurrency and Scaling

Default account concurrency limit is 1,000 concurrent executions across all functions per region (can be increased via Service Quotas).
Lambda scales by 1,000 concurrent executions every 10 seconds until the account limit is reached (12x faster than previous scaling model).
Each function scales independently from other functions in the same account.
Reserved Concurrency – Guarantees a set number of concurrent executions for a specific function; also acts as a maximum concurrency limit for that function.
Provisioned Concurrency – Pre-initializes execution environments to eliminate cold starts.
Maximum Concurrency for SQS – Allows setting a maximum number of concurrent function invocations when using SQS as an event source.
New AWS accounts may have reduced concurrency quotas that are raised automatically based on usage.

Lambda Best Practices

Lambda function code should be stateless and ensure there is no affinity between the code and the underlying compute infrastructure.
Instantiate AWS clients outside the scope of the handler to take advantage of connection re-use.
Make sure you have set +rx permissions on your files in the uploaded ZIP to ensure Lambda can execute code on your behalf.
Lower costs and improve performance by minimizing the use of startup code not directly related to processing the current event.
Use the built-in CloudWatch monitoring of the Lambda functions to view and optimize request latencies.
Delete old Lambda functions that you are no longer using.
Use Graviton2 (arm64) architecture for up to 34% better price-performance with minimal code changes.
Use SnapStart for Java, Python, and .NET functions to reduce cold start latency without additional cost.
Use Lambda Power Tuning to find the optimal memory configuration for cost and performance.
Include the AWS SDK in your deployment package rather than relying on the runtime-included version for version consistency.
Use structured JSON logging with Advanced Logging Controls for better observability.
Keep runtimes up to date – Lambda runtimes are deprecated when the underlying language version reaches end of community LTS support.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

Your serverless architecture using AWS API Gateway, AWS Lambda, and AWS DynamoDB experienced a large increase in traffic to a sustained 400 requests per second, and dramatically increased in failure rates. Your requests, during normal operation, last 500 milliseconds on average. Your DynamoDB table did not exceed 50% of provisioned throughput, and Table primary keys are designed correctly. What is the most likely issue?
1. Your API Gateway deployment is throttling your requests.
2. Your AWS API Gateway Deployment is bottlenecking on request (de)serialization.
3. You did not request a limit increase on concurrent Lambda function executions. (Lambda has a default account concurrency limit of 1,000. At 500 milliseconds per request, each concurrent execution handles 2 requests/second. The default 1,000 concurrency supports 2,000 requests/second, which exceeds 400 rps. However, new accounts may have lower limits. The key concept is understanding Lambda concurrency limits and requesting increases when needed.)
4. You used Consistent Read requests on DynamoDB and are experiencing semaphore lock.
A company wants to reduce cold start latency for their Java-based Lambda functions that power a latency-sensitive API. Which TWO approaches can help? (Choose 2)
1. Enable Lambda SnapStart on the function
2. Configure Provisioned Concurrency for the function
3. Increase the function’s memory allocation to 10 GB
4. Switch the function to asynchronous invocation
5. Enable Lambda Extensions for the function
(SnapStart reduces cold starts by up to 10x by caching initialized snapshots. Provisioned Concurrency keeps environments pre-initialized. Both eliminate cold start latency.)
A team needs to build a multi-step workflow that involves calling multiple APIs, waiting for human approval (which may take days), and then processing the results. The team wants to minimize infrastructure management. Which Lambda capability is most suitable?
1. Lambda Provisioned Concurrency
2. Lambda Managed Instances
3. Lambda Durable Functions
4. AWS Step Functions with Lambda
(Lambda Durable Functions can checkpoint progress, suspend execution for up to one year during waits like human approvals, and automatically recover from failures—all without additional infrastructure. While Step Functions can also orchestrate workflows, Durable Functions provide this within the Lambda programming model itself.)
A company has steady-state Lambda workloads processing 10,000 requests per second. They want to reduce costs while maintaining the serverless programming model. What should they use?
1. Lambda Provisioned Concurrency with Compute Savings Plans
2. Lambda Managed Instances with EC2 Reserved Instances or Compute Savings Plans
3. Migrate to Amazon ECS with Fargate
4. Use Lambda with Graviton2 architecture only
(Lambda Managed Instances allow using EC2 commitment-based pricing (Savings Plans, Reserved Instances) for up to 72% discount while maintaining the Lambda programming model and serverless operational simplicity.)
Which of the following is NOT a supported Lambda runtime as of 2025?
1. Python 3.13
2. Node.js 22
3. Java 21
4. Go 1.x managed runtime
(The Go 1.x managed runtime was deprecated on Jan 8, 2024. Go functions should now use the OS-only runtime provided.al2023 with a custom runtime approach.)
A Lambda function processes files uploaded to S3 and writes processed results back to the same S3 bucket. The team notices unexpectedly high Lambda invocations and costs. What AWS feature helps prevent this?
1. Lambda Reserved Concurrency
2. S3 Event notification filtering
3. Lambda Recursive Loop Detection
4. Lambda function timeout configuration
(Lambda Recursive Loop Detection automatically detects and stops recursive invocations between Lambda and supported services including S3, SQS, and SNS after 16 invocations, preventing runaway costs.)
A company wants their Lambda functions to send response data progressively to clients as it becomes available, rather than waiting for the entire response. Which feature should they use?
1. Lambda Provisioned Concurrency
2. Lambda Response Streaming
3. Lambda Function URLs with buffered response
4. Lambda Asynchronous Invocation with Destinations
(Lambda Response Streaming allows sending response data to callers as it becomes available, supporting larger payloads and enabling progressive rendering for web applications.)