Amazon Athena makes it easy to investigate information with out having to arrange and handle information processing infrastructure. Nonetheless, historically, you wanted to arrange an Amazon Easy Storage Service (Amazon S3) bucket to retailer question outcomes earlier than they might run queries with Athena. The necessity arose to make it even less complicated to start out utilizing Athena, with fewer setup steps.
That’s why we’re thrilled to introduce managed question outcomes, a brand new Athena characteristic that mechanically shops, secures, and manages the lifecycle of question consequence information for you at no extra price. Managed question outcomes simplifies your person expertise by eradicating the necessity to create or select an S3 bucket in your account to carry outcomes earlier than you run queries. It helps scale back your month-to-month price by shifting momentary storage of question outcomes out of your S3 bucket to Athena, and eliminates the necessity for separate processes to delete question consequence information out of your S3 bucket after it’s not wanted. Now, Athena gives each service managed, momentary consequence storage and buyer managed Amazon S3 storage choices to fulfill completely different wants.
What’s extra, utilizing managed question outcomes doesn’t require advanced adjustments to purposes that learn question outcomes from present Athena interfaces, and will increase information safety. Entry to managed question consequence information is now related to AWS Id and Entry Administration (IAM) permissions scoped to particular person Athena workgroups, as an alternative of S3 buckets. Moreover, you’ll be able to mechanically encrypt consequence information with AWS Key Administration Service (AWS KMS) utilizing AWS owned or buyer managed keys.
On this submit, we display how one can get began with managed question outcomes and, by eradicating the undifferentiated effort spent on question consequence administration, how Athena helps you get insights out of your information in fewer steps than earlier than.
Resolution overview
While you use managed question outcomes, you not must create and select S3 buckets to retailer question outcomes, or handle lifecycle guidelines to ensure the consequence information is ultimately cleaned up. The next are some situations the place that is helpful:
- Monetary analysts working in groups analyzing market information, every masking completely different funding areas or monetary devices, would possibly use completely different workgroups for various sorts of analyses or tasks. Now, analysts don’t must spend time organising S3 buckets or fear about cleansing up question outcomes when their work is finished.
- Compliance groups can run audit queries on transaction information for regulatory reporting whereas ensuring solely approved crew members can entry delicate question outcomes by means of IAM permissions. As a result of question outcomes are cleaned up mechanically, the compliance crew not requires separate processes to delete question consequence information.
- Information and analytics and platform automation groups who’re chargeable for streamlined onboarding of latest customers and groups not must configure particular person S3 buckets and permissions for various customers and groups, simplifying their automation code.
The next are among the key options of managed question leads to Athena:
- It removes the necessity to decide on an S3 bucket location earlier than you run queries.
- There isn’t any extra price to retailer your question outcomes, and question outcomes are mechanically deleted after a time period, lowering administration overhead from separate bucket cleanup processes.
- It’s simple to get began: new and preexisting workgroups could be seamlessly configured to make use of managed question outcomes. You’ll be able to have a mixture of Athena managed and buyer managed question leads to your AWS account.
- You should utilize streamlined IAM permissions with entry to learn outcomes utilizing
GetQueryResults
andGetQueryResultsStream
tied to particular person workgroups. - Question outcomes are mechanically encrypted along with your selection of AWS owned or buyer managed KMS keys.
Let’s stroll by means of how one can get began with managed question outcomes.
1. Configure your workgroup
Full the next steps to configure your workgroup:
- On the Athena console, select Workgroups within the navigation pane.
- Select Create workgroup.
Alternatively, you’ll be able to choose an present workgroup and select Edit.
- For Question consequence configuration, choose Athena managed.
- Navigate to the Athena console. To create a brand new workgroup, within the Workgroups web page choose the Create Workgroup button. To edit an present workgroup, choose a workgroup from the record and within the workgroup element web page, choose the Edit button. Beneath Question consequence configuration part, you will notice the choice for Athena managed:
- For Encrypt question outcomes, select your most well-liked encryption technique
Determine 1: Question consequence configuration
2. Configure Encryption
Select your most well-liked encryption technique for question outcomes:
- Encrypt utilizing an AWS owned key – That is the default choice. It signifies that you really want question outcomes to be encrypted and decrypted by an AWS owned key.
- Encrypt utilizing a buyer managed key – Select this feature if you wish to encrypt and decrypt question outcomes with your individual key. To have Athena use your buyer managed key, specify the Athena service within the
Principal
parts of the important thing coverage. For extra info, see Setup an AWS KMS key coverage for managed storage. To run queries, the person querying information wants permission to entry your key.
3. Question your information
After you’ve configured your workgroup for managed question outcomes, you’ll be able to instantly begin working queries. Let’s run a pattern question towards the AWS Price and Utilization Report.
The Athena console banner signifies that our workgroup, demo-workgroup
, was up to date to make use of managed question outcomes. Our question ran efficiently, and we didn’t must arrange an S3 bucket. To obtain these outcomes, select Obtain outcomes CSV.
Determine 2: Working a question towards the Price and Utilization report within the Athena console
You’ll be able to entry these outcomes by means of the Athena console and utilizing the Athena APIs.
Determine 3: Accessing the question outcomes by way of the Athena API
Conclusion
On this submit, we launched managed question outcomes, a brand new Athena characteristic that streamlines the question expertise by means of automated storage of question outcomes, offers computerized cleanup, and limits question consequence entry with IAM permissions. Managed question outcomes reduces operational overhead, empowering each information analysts working interactive queries and groups constructing advanced analytics pipelines to deal with deriving insights relatively than managing infrastructure. We demonstrated how one can configure workgroups for managed storage and successfully use this characteristic in question situations.
To begin utilizing managed question outcomes with Athena, merely configure your workgroups by means of the Athena console or APIs. For extra info, see Managed question outcomes.
Concerning the Authors
Man Bachar is a Sr. Options Architect at AWS. He makes a speciality of aiding capital markets and FinTech prospects with their cloud transformation journeys. His experience encompasses identification administration, safety, and unified communication.
Sayan Chakraborty is a Sr. Options Architect at AWS. He helps giant enterprises construct safe, scalable, and performant options on AWS. With a background in enterprise and know-how structure, he has expertise delivering large-scale digital transformation packages throughout a variety of business verticals.
Darshit Thakkar is a Technical Product Supervisor at AWS and works out of Boston, Massachusetts. He works intently with prospects to grasp how they use information, and drives product improvements that make information extra actionable at scale.