We’re excited to announce AWS Glue Information Catalog utilization metrics. The utilization metrics is a brand new characteristic that gives native integration with Amazon CloudWatch. This characteristic supplies you with fast visibility into your AWS Glue Information Catalog API utilization patterns and traits.
AWS Glue Information Catalog is a centralized repository that shops metadata about your group’s datasets. With its unified interface that acts as an index, you may retailer and question details about your information sources, together with their location, codecs, schemas, and runtime metrics.
As you scale your lakehouse structure on Amazon Net Providers (AWS) and keep dependable information operations, observability and monitoring turns into important to understanding and optimizing Information Catalog API usages.
With Information Catalog utilization metrics in CloudWatch, you may obtain the next:
Monitor API name patterns at 1-minute intervals
Proactively request service quota enhance for API charge limits
Allow the CloudWatch pre-built anomaly detection characteristic to determine abnormalities in your API utilization
Perceive lakehouse utilization throughout greater than 50 APIs
On this put up, we reveal entry these metrics, present a step-by-step walkthrough, and arrange significant alarms.
Entry Information Catalog utilization metrics in Amazon CloudWatch console
To entry Information Catalog utilization metrics, full the next steps:
Open Amazon CloudWatch console
Below Metrics, select All metrics
Within the search bar, enter Glue and select Enter
Select Utilization > By AWS Useful resource, as proven within the following screenshot
The Metrics part opens and shows completely different catalog utilization metrics that you could choose from to create dashboards and alarms, as proven within the following screenshot
Monitor CallCount metrics
Every Amazon CloudWatch metric for Information Catalog is of a kind API and set as CallCount. Which means that for every API name on that particular useful resource (for instance, GetConnection API) will likely be logged as one rely. These metrics can seamlessly combine into your present CloudWatch dashboards, or you should use them to create new ones. For proactive monitoring, you may configure customized alarms that set off routinely when this API utilization exceeds your outlined thresholds, serving to you adjust to service limits.
Below the Graphed metrics tab, you may present extra customizations to match your monitoring wants. Within the Particulars column, you may create alarms and allow anomaly detection to determine uncommon patterns.
To assist with efficient API monitoring, CallCount metrics particularly deal with profitable API calls. This manner, you’ve gotten extra exact monitoring and may troubleshoot various kinds of API behaviors. The next screenshot exhibits the AWS Glue utilization metrics view for GetTables API.
Within the Statistics column, you may view your API utilization past the default Sum, Min, and Max metrics. Now you can choose all kinds of statistical strategies to research your utilization patterns, as proven within the following screenshot.
Metrics and dimensions for Information Catalog utilization metrics
Information Catalog utilization metrics use the AWS/Utilization namespace and supply CallCount metrics. These metrics are revealed with the size Service, Useful resource, Kind and Class.
The CallCount metric doesn’t have a specified unit. Probably the most helpful statistic for the metric is SUM, which represents the entire operation rely for the 1-minute interval. An vital observe is that the metric worth is emitted at 1-minute intervals. Decreasing the interval additional (for instance, to 1 second) received’t change the emittance interval.
Metrics
Metric
Description
CallCount
The variety of specified operations carried out in your account.
Dimensions
Dimension key
Dimension worth
Description
Service
AWS Glue
The title of the AWS service containing the useful resource. For Information Catalog utilization metrics, the worth for this dimension is AWS Glue.
Kind
API
The kind of useful resource being tracked. At the moment, when the Service dimension is AWS Glue, the one legitimate worth for Kind is API.
Useful resource
The title of the API operation. Legitimate values embody the next:
GetCatalogs, GetCatalog, GetDatabases, GetDatabase, GetTables, GetTable, GetTableVersion, GetTableVersions, SearchTables, GetPartitionIndexes, GetColumnStatisticsForTable, GetPartition, GetPartitions, BatchGetPartition, GetColumnStatisticsForPartition, GetConnection, GetConnections, GetUserDefinedFunction, GetUserDefinedFunctions, GetCatalogImportStatus, GetTableOptimizer, BatchGetTableOptimizer, ListTableOptimizerRuns, CreateCatalog, CreateDatabase, CreateTable, CreatePartitionIndex, CreatePartition, BatchCreatePartition, CreateConnection, CreateUserDefinedFunction, CreateTableOptimizer, UpdateCatalog, UpdateDatabase, UpdateTable, UpdateColumnStatisticsForTable, UpdatePartition, BatchUpdatePartition, UpdateColumnStatisticsForPartition, UpdateConnection, UpdateUserDefinedFunction, UpdateTableOptimizer, DeleteCatalog, DeleteDatabase, DeleteTable, BatchDeleteTable, DeleteTableVersion, DeletePartitionIndex, DeleteColumnStatisticsForTable, DeletePartition, BatchDeletePartition, DeleteColumnStatisticsForPartition, DeleteConnection, BatchDeleteConnection, DeleteUserDefinedFunction, DeleteTableOptimizer, TestConnection, ImportCatalogToGlue
Class
None
The category of useful resource being tracked. Information Catalog utilization metrics use this dimension with a price of None.
Arrange CloudWatch alarms for Information Catalog utilization metrics
Information Catalog has outlined guidelines to handle atypical utilization patterns that restrict the shopper name charge on the granularity of requests per second. You may generate CloudWatch alarms utilizing the CallCount metric in order that restrict will increase might be completed proactively. To configure a CloudWatch alarm with this threshold, full the next steps:
On the CloudWatch metrics console, choose one of many obtainable metrics, as proven within the following screenshot. On this instance, we choose the useful resource GetTables. You may choose a number of metrics to suit your use case.
Select Graphed metrics.
Select Sum as the first statistic.
Set interval to 1 minute.
Select Particulars and Create Alarm.
For Threshold sort, select Anomaly Detection. It’s also possible to choose Static based mostly in your necessities and after you’ve decided a selected threshold worth.
Set the Anomaly detection threshold to 2 (default). The edge worth is used to find out the conventional vary of values for the metric. A better worth produces a thicker band of regular values. For extra data on how CloudWatch anomaly detection works, discuss with How CloudWatch anomaly detection works.
Select Subsequent.
For Ship a notification to the next SNS matter, select Create new matter.
For Create a brand new matter, enter your Amazon Easy Notification Service (Amazon SNS) matter title.
For Electronic mail endpoints that may obtain the notification, enter your e mail deal with. On this instance, we’re going to create a brand new SNS matter. Nonetheless, you should use your present SNS matters or use different choices reminiscent of AWS Lambda or auto scaling motion.
Select Create matter.
Scroll down and select Subsequent.
Enter an alarm title and an outline and select Subsequent.
Overview all the small print you’ve entered and select Create alarm, as proven within the following screenshot.
By following these steps, you’ve efficiently configured a CloudWatch alarm utilizing anomaly detection that displays your Information Catalog utilization with the brink that you just set. The alarm will set off when the CallCount metric exceeds the calculated threshold, sending notifications to your specified SNS matter and e mail endpoints.
This proactive monitoring strategy prevents API charge restrict points and supplies a clean operation of your Information Catalog utilization. For extra data on utilizing CloudWatch alarms, discuss with Utilizing Amazon CloudWatch alarms.
Conclusion
AWS Glue Information Catalog utilization metrics is an efficient enhancement to your information infrastructure monitoring capabilities. It addresses the rising want for detailed observability by Amazon CloudWatch in trendy information architectures constructed on prime of Information Catalog. You now have entry to extra granular statistics, shifting past easy most and common request metrics to complete efficiency indicators together with p99 percentiles. These metrics are emitted in 1-minute intervals, offering visibility into your information catalog operations. Organizations can now proactively determine bottlenecks earlier than they have an effect on operations and effectively conduct capability planning by detailed utilization patterns.
From constructing monitoring dashboards to establishing alerts, the native assist with CloudWatch anomaly detection and versatile alarm configurations makes it easy to proactively monitor your lakehouse deployment and forestall abnormalities in your lakehouse utilization. For extra data, discuss with Monitoring Information Catalog utilization metrics in Amazon CloudWatch within the AWS Glue documentation. We advocate testing and utilizing these metrics as a part of your trendy monitoring and observability technique. We encourage you to share your suggestions with us.
In regards to the authors
David Zhang is an Analytics Options Architect specializing in designing and implementing large-scale information infrastructure, ETL processes, and in depth information administration methods. He helps clients modernize information platforms on Amazon Net Providers (AWS). David can also be an energetic speaker at AWS occasions and contributor to technical content material and open supply initiatives. He enjoys enjoying volleyball, tennis, and basketball throughout his free time.
Noritaka Sekiyama is a Principal Massive Information Architect with Amazon Net Providers (AWS) Analytics companies. He’s chargeable for constructing software program artifacts to assist clients. In his spare time, he enjoys biking on his street bike.
Sandeep Adwankar is a Senior Product Supervisor at AWS. Based mostly within the California Bay Space, he works with clients across the globe to translate enterprise and technical necessities into merchandise that allow clients to enhance how they handle, safe, and entry information.
Abhay Joshi is a Software program Improvement Engineer at AWS Glue and AWS Lake Formation. He’s enthusiastic about constructing fault tolerant and dependable distributed methods at scale.