AWS AI & Machine Learning Services

Amazon Bedrock
- is a fully managed service providing access to high-performing foundation models (FMs) from leading AI companies (GA September 2023).
- offers foundation models from AI21 Labs, Amazon, Anthropic, Cohere, Meta, Mistral AI, and Stability AI through a unified API.
- enables building and scaling generative AI applications without managing infrastructure.
- supports model customization with your own data while maintaining data privacy and security.
- provides serverless experience with pay-per-use pricing.
- includes capabilities for text generation, chat, image generation, and embeddings.
- supports Retrieval Augmented Generation (RAG) with knowledge bases.
- provides AI agents for multi-step task automation (2024).
- ensures data is not used to train base models and remains within your AWS environment.
Amazon Q Developer (formerly CodeWhisperer)
- is a generative AI-powered coding assistant for software developers (rebranded April 2024).
- provides real-time code suggestions, completions, and generation based on comments and existing code.
- supports multiple programming languages including Python, Java, JavaScript, TypeScript, C#, Go, Rust, PHP, Ruby, Kotlin, C, C++, Shell, SQL, and more.
- integrates with popular IDEs including VS Code, IntelliJ IDEA, PyCharm, WebStorm, and AWS Cloud9.
- performs security scanning to identify and suggest fixes for vulnerabilities.
- provides code explanations and documentation generation.
- assists with debugging, upgrading applications, and troubleshooting.
- tracks open-source code references and license information.
- offers free tier for individual developers and paid tier for professional use.
Amazon Q Business
- is a generative AI-powered assistant for enterprise use (GA April 2024).
- answers questions, provides summaries, generates content, and completes tasks based on enterprise data.
- connects to 40+ enterprise data sources including S3, SharePoint, Salesforce, ServiceNow, Jira, and more.
- respects existing access controls and permissions from connected data sources.
- provides conversational interface for employees to access company information.
- enables custom plugins for extending functionality.
- supports Amazon Q Apps for creating AI-powered applications from conversations.
- ensures enterprise data privacy and security with data isolation.
Amazon SageMaker
- Build, train, and deploy machine learning models at scale
- fully-managed service that enables data scientists and developers to quickly and easily build, train & deploy machine learning models.
- enables developers and scientists to build machine learning models for use in intelligent, predictive apps.
- is designed for high availability with no maintenance windows or scheduled downtimes.
- allows users to select the number and type of instance used for the hosted notebook, training & model hosting.
- can be deployed as endpoint interfaces and batch.
- supports Canary deployment using ProductionVariant and deploying multiple variants of a model to the same SageMaker HTTPS endpoint.
- supports Jupyter notebooks.
- Users can persist their notebook files on the attached ML storage volume.
- Users can modify the notebook instance and select a larger profile through the SageMaker console, after saving their files and data on the attached ML storage volume.
- includes built-in algorithms for linear regression, logistic regression, k-means clustering, principal component analysis, factorization machines, neural topic modeling, latent dirichlet allocation, gradient boosted trees, seq2seq, time series forecasting, word2vec & image classification
- algorithms work best when using the optimized protobuf recordIO format for the training data, which allows Pipe mode that streams data directly from S3 and helps faster start times and reduce space requirements
- provides built-in algorithms, pre-built container images, or extend a pre-built container image and even build your custom container image.
- supports users custom training algorithms provided through a Docker image adhering to the documented specification.
- also provides optimized MXNet, Tensorflow, Chainer & PyTorch containers
- ensures that ML model artifacts and other system artifacts are encrypted in transit and at rest.
- requests to the API and console are made over a secure (SSL) connection.
- stores code in ML storage volumes, secured by security groups and optionally encrypted at rest.
- SageMaker Neo is a new capability that enables machine learning models to train once and run anywhere in the cloud and at the edge.
Amazon SageMaker Studio
- is a unified web-based IDE for machine learning development (launched December 2019, updated November 2023).
- provides single interface to perform all ML development steps including data preparation, model building, training, and deployment.
- offers choice of IDEs including JupyterLab, Code Editor (based on VS Code OSS), and RStudio.
- enables quick IDE launch with on-demand compute resource scaling.
- improves data science team productivity by up to 10x.
- provides complete access, control, and visibility into each ML development step.
- supports collaboration with shared notebooks and experiments.
- integrates with all SageMaker capabilities including Canvas, Clarify, and HyperPod.
- Note: Previous experience renamed to “SageMaker Studio Classic” as of November 30, 2023.
Amazon SageMaker Canvas
- is a no-code machine learning service for business analysts (launched November 2021).
- enables building accurate ML models without writing code or requiring ML expertise.
- provides visual, point-and-click interface for data preparation and model building.
- supports tabular, image, and text data for predictions.
- connects to 50+ data sources including S3, Redshift, Snowflake, and SaaS applications.
- offers ready-to-use ML models and custom model building capabilities.
- includes generative AI capabilities (October 2023) for text generation, summarization, and content creation.
- provides automated feature engineering, algorithm selection, and hyperparameter tuning.
- enables one-click model deployment and batch predictions.
- supports collaboration between business analysts and data scientists.
Amazon SageMaker Clarify
- provides bias detection and model explainability capabilities (launched December 2020).
- identifies biases in training data and ML models across different groups (age, gender, income, etc.).
- detects potential bias during data preparation, after model training, and in deployed models.
- generates detailed reports quantifying different types of possible bias.
- provides feature importance graphs to explain model predictions.
- integrates with SageMaker Data Wrangler for bias detection during data preparation.
- supports continuous monitoring of deployed models for bias drift.
- helps meet regulatory requirements and ethical AI standards.
- produces reports for internal presentations and compliance documentation.
Amazon SageMaker HyperPod
- is purpose-built infrastructure for distributed training at scale (GA November 2023).
- reduces time to train foundation models by up to 40% with optimized infrastructure.
- supports GPU-based and AWS Trainium-based instances for cost-effective training.
- provides automated cluster health monitoring and node replacement.
- enables training for weeks or months with automated resiliency.
- automatically saves checkpoints and resumes training from last checkpoint on failure.
- efficiently distributes models and data across thousands of compute resources.
- includes preconfigured distributed training libraries for popular frameworks.
- provides recipes for accelerating foundation model training and fine-tuning.
- offers flexible training plans to meet timelines and budgets.
Amazon Textract
- Textract provides OCR and helps add document text detection and analysis to the applications.
- includes simple, easy-to-use API operations that can analyze image files and PDF files.
Amazon Comprehend
- Comprehend is a managed natural language processing (NLP) service to find insights and relationships in text.
- identifies the language of the text; extracts key phrases, places, people, brands, or events; understands how positive or negative the text is; analyzes text using tokenization and parts of speech; and automatically organizes a collection of text files by topic.
- can analyze a collection of documents and other text files (such as social media posts) and automatically organize them by relevant terms or topics.
Amazon Lex
- is a service for building conversational interfaces using voice and text.
- provides the advanced deep learning functionalities of automatic speech recognition (ASR) for converting speech to text, and natural language understanding (NLU) to recognize the intent of the text, to enable building applications with highly engaging user experiences and lifelike conversational interactions.
- common use cases of Lex include: Application/Transactional bot, Informational bot, Enterprise Productivity bot, and Device Control bot.
- leverages Lambda for Intent fulfillment, Cognito for user authentication & Polly for text-to-speech.
- scales to customers’ needs and does not impose bandwidth constraints.
- is a completely managed service so users don’t have to manage the scaling of resources or maintenance of code.
- uses deep learning to improve over time.
Amazon Polly
- text into speech
- uses advanced deep-learning technologies to synthesize speech that sounds like a human voice.
- supports Lexicons to customize pronunciation of specific words & phrases
- supports Speech Synthesis Markup Language (SSML) tags like prosody so users can adjust the speech rate, pitch, pauses, or volume.
Amazon Rekognition
- analyzes image and video
- identify objects, people, text, scenes, and activities in images and videos, as well as detect any inappropriate content.
- provides highly accurate facial analysis and facial search capabilities that can be used to detect, analyze, and compare faces for a wide variety of user verification, people counting, and public safety use cases.
- helps identify potentially unsafe or inappropriate content across both image and video assets and provides detailed labels that help accurately control what you want to allow based on your needs.
- provides Rekognition Custom Labels (launched December 2019) – an AutoML feature to build custom ML models for detecting specific objects and scenes unique to business needs.
- Custom Labels requires as few as 10 sample images per label to train custom models.
- Custom Labels automatically selects optimal ML algorithms and trains models without requiring ML expertise.
- enables identifying business-specific items like machine parts, product defects, or brand logos.
Amazon Forecast
- Amazon Forecast is a fully managed time-series forecasting service that uses statistical and machine learning algorithms to deliver highly accurate time-series forecasts and is built for business metrics analysis.
- automatically tracks the accuracy of the model over time as new data is imported. Model’s deviation from initial quality metrics can be systematically quantified and used to make more informed decisions about keeping, retraining, or rebuilding the model as new data comes in.
- provides six built-in algorithms which include ARIMA, Prophet, NPTS, ETS, CNN-QR, and DeepAR+.
- integrates with AutoML to choose the optimal model for the datasets.
Amazon SageMaker Ground Truth
- helps build highly accurate training datasets for machine learning quickly.
- offers easy access to labelers through Amazon Mechanical Turk and provides them with built-in workflows and interfaces for common labeling tasks.
- allows using your own labelers or use vendors recommended by Amazon through AWS Marketplace.
- helps lower labeling costs by up to 70% using automatic labeling, which works by training Ground Truth from data labeled by humans so that the service learns to label data independently.
- provides annotation consolidation to help improve the accuracy of the data object’s labels.
Amazon Translate
- provides natural and fluent language translation
- a neural machine translation service that delivers fast, high-quality, and affordable language translation.
- Neural machine translation is a form of language translation automation that uses deep learning models to deliver more accurate and natural-sounding translation than traditional statistical and rule-based translation algorithms.
- allows content localization – such as websites and applications – for international users, and to easily translate large volumes of text efficiently.
Amazon Transcribe
- provides speech-to-text capability
- uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately.
- can be used to transcribe customer service calls, automate closed captioning and subtitling, and generate metadata for media assets to create a fully searchable archive.
- adds punctuation and formatting so that the output closely matches the quality of manual transcription at a fraction of the time and expense.
- process audio in batch or near real-time.
- supports automatic language identification.
- supports custom vocabulary to generate more accurate transcriptions for domain-specific words and phrases like product names, technical terminology, or names of individuals.
- supports specifying a list of words to remove from transcripts.
- provides Transcribe Call Analytics (launched August 2021) for extracting insights from customer conversations.
- Call Analytics generates turn-by-turn transcripts with speaker identification and sentiment analysis.
- supports real-time Call Analytics (November 2022) for live conversation insights and agent assistance.
- provides Transcribe Medical for healthcare and medical transcription with HIPAA eligibility.
Amazon Kendra
- is an intelligent search service that uses NLP and advanced ML algorithms to return specific answers to search questions from your data.
- uses its semantic and contextual understanding capabilities to decide whether a document is relevant to a search query.
- returns specific answers to questions, giving users an experience that’s close to interacting with a human expert.
- provides a unified search experience by connecting multiple data repositories to an index and ingesting and crawling documents.
- can use the document metadata to create a feature-rich and customized search experience for the users, helping them efficiently find the right answers to their queries.
Augmented AI (Amazon A2I)
- Augmented AI (Amazon A2I) is an ML service that makes it easy to build the workflows required for human review.
- brings human review to all developers, removing the undifferentiated heavy lifting associated with building human review systems or managing large numbers of human reviewers, whether it runs on AWS or not.
Amazon Personalize
- Personalize is a fully managed machine learning service that uses data to generate item recommendations.
- can also generate user segments based on the users’ affinity for certain items or item metadata.
- generates recommendations primarily based on item interaction data that comes from the users interacting with items in the catalog.
- includes API operations for real-time personalization, and batch operations for bulk recommendations and user segments.
Amazon Panorama
- brings computer vision to the on-premises camera network.
- AWS Panorama Appliance or another compatible device can be installed in the data center and registered with AWS Panorama to deploy computer vision applications from the cloud.
- AWS Panorama Appliance
- is a compact edge appliance that uses a powerful system-on-module (SOM) that is optimized for ML workloads.
- can run multiple computer vision models against multiple video streams in parallel and output the results in real-time.
- is designed for use in commercial and industrial settings and is rated for dust and liquid protection.
- works with the existing real-time streaming protocol (RTSP) network cameras.
Amazon Fraud Detector
- Fraud Detector is a fully managed service to identify potentially fraudulent online activities such as online payment fraud and fake account creation.
- takes care of all the heavy lifting such as data validation and enrichment, feature engineering, algorithm selection, hyperparameter tuning, and model deployment.
AWS IoT Greengrass ML Inference
- IoT Greengrass helps perform machine learning inference locally on devices, using models that are created, trained, and optimized in the cloud.
- provides flexibility to use machine learning models trained in SageMaker or to bring your pre-trained model stored in S3.
- helps get inference results with very low latency to ensure the IoT applications can respond quickly to local events.
Amazon Elastic Inference
- helps attach low-cost GPU-powered acceleration to EC2 and SageMaker instances or ECS tasks to reduce the cost of running deep learning inference by up to 75%.
- supports TensorFlow, Apache MXNet, and ONNX models, with more frameworks coming soon.
AWS Certification Exam Practice Questions
- Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).
- AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.
- AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated
- Open to further feedback, discussion and correction.
- A company has built a deep learning model and now wants to deploy it using the SageMaker Hosting Services. For inference, they want a cost-effective option that guarantees low latency but still comes at a fraction of the cost of using a GPU instance for your endpoint. As a machine learning Specialist, what feature should be used?
- Inference Pipeline
- Elastic Inference
- SageMaker Ground Truth
- SageMaker Neo
- A machine learning specialist works for an online retail company that sells health products. The company allows users to enter reviews of the products they buy from the website. The company wants to make sure the reviews do not contain any offensive or unsafe content, such as obscenities or threatening language. Which Amazon SageMaker algorithm or service will allow scanning user’s review text in the simplest way?
- BlazingText
- Transcribe
- Semantic Segmentation
- Comprehend
- A company develops a tool whose coverage includes blogs, news sites, forums, videos, reviews, images, and social networks such as Twitter and Facebook. Users can search data by using Text and Image Search, and use charting, categorization, sentiment analysis, and other features to provide further information and analysis. They want to provide Image and text analysis capabilities to the applications which include identifying objects, people, text, scenes, and activities, and also provide highly accurate facial analysis and facial recognition. What service can provide this capability?
- Amazon Comprehend
- Amazon Rekognition
- Amazon Polly
- Amazon SageMaker
- A company wants to build generative AI applications using foundation models without managing infrastructure. Which service should they use?
- Amazon SageMaker
- Amazon Comprehend
- Amazon Bedrock
- Amazon Lex
- A development team wants an AI assistant that provides real-time code suggestions and security scanning in their IDE. Which service should they use?
- Amazon CodeGuru
- Amazon Q Developer
- AWS Cloud9
- Amazon SageMaker
- A business analyst with no ML experience wants to build accurate ML models using a visual interface. Which service should they use?
- Amazon SageMaker Studio
- Amazon SageMaker Canvas
- Amazon Forecast
- Amazon Personalize
- A company needs to detect bias in their ML models and explain predictions for regulatory compliance. Which service should they use?
- Amazon SageMaker Ground Truth
- Amazon Inspector
- Amazon SageMaker Clarify
- AWS Audit Manager
- A company wants to train large foundation models for weeks with automated resiliency and checkpoint management. Which service should they use?
- Amazon SageMaker Training Jobs
- Amazon SageMaker HyperPod
- AWS Batch
- Amazon EC2 with GPU instances
- A contact center wants real-time insights from customer calls including sentiment analysis and agent assistance. Which service should they use?
- Amazon Transcribe
- Amazon Transcribe Call Analytics
- Amazon Comprehend
- Amazon Connect
- A company wants to build custom image recognition models to identify specific machine parts with minimal training data. Which service should they use?
- Amazon Rekognition (standard)
- Amazon Rekognition Custom Labels
- Amazon SageMaker
- Amazon Textract