Is your data secure? Find out with our free IBM security assessment! Learn More →

Services
Focus Areas

Areas of Expertise

Interests
Engagements

Discover

Build

Support
Areas of Expertise

App Modernization

Public Sector

Serverless

IoT

DevOps

Migration

Data and Machine Learning

Enterprise Architecture

24/7 Monitoring

Team Support

Datadog

Overview

Are you taking advantage of modernizing your AWS apps to protect your cloud investments?

Overview

Our mission is to accelerate high-quality cloud adoption across the Public Sector.

Overview

Whether you are new to serverless or looking to scale, Trek10 allows you to focus on building applications, not managing servers.

Related Content

AWS Lambda

With AWS Lambda, you can run code without the need for managing servers in a cost-effective manner.

Blog

What is Serverless and Why Does it Matter?

Overview

Whether you’re looking to gain visibility into plant floor machinery or seeking to enhance process efficiency, Trek10 can help.

Related Content

Blog

Serverless Architectures: IoT

Blog

Is IoT Device Shadow Right for You?

or should you build-your-own with DynamoDB?

Overview

Shorten the development lifecycle, increase reliability, and release software faster.

Related Content

AWS CloudFormation

AWS CloudFormation helps you save time and money by configuring and managing resources for you.

Containers on AWS

Containers on AWS makes managing container registries easy, autonomous, reliable, and safe from anywhere.

Overview

At Trek10, we rapidly migrate your applications with a focus on cost-effectiveness

Related Content

Amazon WorkSpaces

Amazon WorkSpaces allows you to quickly scale according to your virtual desktop needs.

Containers on AWS

Containers on AWS makes managing container registries easy, autonomous, reliable, and safe from anywhere.

Overview

Uncover insights from your data no matter where you are in your analytics journey.

Related Content

Machine Learning Ops

MLOps constitute best practices for developing, deploying, and monitoring high precision Machine Learning models.

Amazon SageMaker

Amazon SageMaker enables developers and data scientists to easily build ML models.

Overview

Enterprise Architecture (EA) combines business and technology in a proven industry recognized framework to deliver business focused results based on your industry, environment, competition and the ever increasing capabilities of cloud technologies.

Related Content

Developer Acceleration

A series of in-person architect-led training modules designed to help your team develop the necessary skills and best practices to modernize your applications.

Overview

Maximize the uptime and security of your most critical applications.

Related Content

Amazon CloudWatch

Amazon CloudWatch makes performance monitoring simple for you and your business.

Disaster Recovery

Prevent downtime, strengthen resilience, and avoid unanticipated costs with a comprehensive Disaster Recovery Plan.

Overview

Experienced solutions architects and developers at your service, on-demand.

Related Content

Amazon CloudWatch

Amazon CloudWatch makes performance monitoring simple for you and your business.

Disaster Recovery

Prevent downtime, strengthen resilience, and avoid unanticipated costs with a comprehensive Disaster Recovery Plan.

Overview

Let Trek10 help you hit the ground running with Datadog.

Related Content

AWS Premier Partner

Interests

Amazon API Gateway

Amazon Athena

Amazon CloudFront

Amazon CloudWatch

Amazon Cognito

Amazon Connect

Amazon DynamoDB

Amazon Elastic Kubernetes Service (EKS)

Amazon ElastiCache

Amazon EventBridge

Amazon Kinesis

Amazon QuickSight

Amazon RDS

Amazon Redshift

Amazon SageMaker

Amazon WorkSpaces

AWS CloudFormation

AWS CodePipeline

AWS Config

AWS Control Tower

AWS Database Migration Service

AWS Fargate

AWS Glue

AWS Glue Databrew

AWS IoT Architecture

AWS IoT Devices

AWS IoT Greengrass

AWS IoT SiteWise

AWS Lambda

AWS MAP (Migration Acceleration Program)

AWS Serverless Application Model (SAM)

AWS WAF

AWS Well-Architected Framework

Containers on AWS

Data & Analytics on AWS

DevOps Security in AWS

Disaster Recovery

Industrial Machine Connectivity/Connected Factory

Machine Learning Ops

Serverless Analytics in AWS

Serverless Architectures in AWS

Overview

Amazon API Gateway is a fully-managed, easily configurable entry point for your web services.

Overview

Analyze and query data easily at a mass scale from a variety of platform services using Amazon Athena.

Overview

Amazon CloudFront is a content delivery network (CDN) which is a distributed system that delivers applications, websites, and content to users based on factors such as users’ geographical locations, or the origins of the content and delivery servers.

Overview

CloudWatch is an AWS service that allows for basic-to-detailed performance monitoring of your applications and AWS environment resources within a single platform.

Overview

Make it easy to add user sign-up, sign-in, and access control to your web and mobile applications with Amazon Cognito.

Overview

Amazon Connect is an affordable omni-channel cloud-based contact center that enables companies to deliver advanced level support to customers without the burden of maintaining on-premise legacy systems.

Overview

Amazon DynamoDB is the one of the fastest and most versatile, serverless key-value and document database options available in the cloud today.

Overview

The benefits of Kubernetes without the upfront infrastructure hassles.

Overview

Traditional relational databases do not scale well horizontally, and even right-sized NoSQL databases can become a bottleneck under high traffic.

Overview

AWS EventBridge makes it easy to connect applications together using data from Software-as-a-Service(SaaS), AWS services, and one’s own applications.

Overview

An AWS-managed service, Kinesis is a solution that allows users to analyze streaming data in real-time.

Overview

QuickSight is an AWS-managed business intelligence tool that allows you to quickly assess your business.

Overview

Managed Relational Database Service

Overview

An AWS cloud data warehousing solution that stands out.

Overview

Amazon SageMaker is a fully managed service that allows developers and data scientists to build, train, and deploy machine learning (ML) models much faster and efficiently for your specific use cases.

Overview

Amazon WorkSpaces is a managed, secure Desktop-as-a-Service (DaaS) that helps you cut the noise and cost of traditional VDI platforms.

Overview

CloudFormation is a free AWS service that enables taking declarative code and creating AWS resources configured exactly as declared via templates.

Overview

A continuous delivery service.

Overview

Continually assess, audit, and evaluate your AWS resources using AWS Config.

Overview

Set up and govern multi-account AWS environments with AWS Control Tower.

Overview

Migrate a wide variety of databases to or within AWS utilizing AWS Database Migration Service.

Overview

With AWS Fargate, you can deploy containers in AWS without managing any underlying host infrastructure.

Overview

AWS Glue is a fully managed, scalable, serverless data ingestion service that enables customers to extract, transform, and load (ETL) data for analytics.

Overview

AWS Glue DataBrew is an interactive data preparation tool for cleaning, normalizing, analyzing, and adjusting datasets.

Overview

We break down IoT ecosystems into five foundational components that revolve around cloud-based data insights.

Overview

In general IoT device platforms can be divided into two categories: Embedded Systems and Edge devices.

Overview

AWS IoT Greengrass is an open-source runtime for IoT devices to interact with AWS cloud services.

Overview

IoT SiteWise is an AWS service that can be used to collect, process, analyze and monitor industrial IoT data on AWS.

Overview

AWS Lambda is one of the most revolutionary serverless compute services offered in cloud computing today, allowing you to easily run code for practically any type of application or backend service.

Overview

MAP helps you accelerate cloud migration and modernization with an outcome-driven methodology.

Overview

Enable your team to build serverless applications faster with this open-source framework from AWS.

Overview

Protect against web attacks.

Overview

A Complete Guide to the AWS Well-Architected Framework.

Overview

Amazon Elastic Container Registry (ECR) makes data storage, management sharing, and deployment possible from anywhere.

Overview

AWS provides integrated end-to-end solutions for modern data management and advanced analytics.

Overview

Applying Devops Security for an AWS application.

Overview

A Disaster Recovery Plan (DRP) is a structured and detailed set of instructions geared to recover a system and networks in the event of failure or attack, with the aim of helping the organization get back to being operational as fast as possible.

Overview

In addition to the full range of AWS IoT architecture and support capabilities, we offer an Industrial IoT Proof of Value (POV) solution.

Overview

Machine learning operations (MLOps) is the umbrella term for best practices surrounding machine learning.

Overview

Using AWS serverless services as building blocks, you can now easily and rapidly build data lakes and data pipelines that process and analyze petabytes of data without needing to manage any infrastructure components.

Overview

Let AWS handle the burden of server management so you can focus your time on solutions for clients. By adopting a serverless architecture, you tremendously reduce the operational complexity of running your application, enabling you to focus on delivering new features faster without compromising security, reliability, and performance.

Discover

Cloud-Native Immersion Day

Developer Acceleration

Retail | Industry Overview

SaaS on AWS

Serverless Workshop

Overview

Trek10's Cloud-Native Immersion Days are focused, high impact training sessions that will drench your teams in knowledge of the latest tech and best-practices.

Overview

Trek10’s expert-led Developer Acceleration workshops help enterprise teams quickly and safely jump-start their serverless journey.

Overview

Leveraging the vast capabilities of the AWS ecosystem, Trek10 provides retail businesses with solutions tailored to their unique needs, enabling them to innovate at speed and scale.

Overview

Trek10 helps companies migrate and build their SaaS offering on AWS with a cloud-native approach.

Overview

Whether it’s a greenfield project or re-architecting legacy, Trek10 is your guide to adopting cloud native architectures.

Build

DevOps Transformation

Internet of Things (IoT) Applications

Security

Overview

At Trek10, we leverage the best AWS native and third party tools for code-defined infrastructure, continuous integration, and automated deployment pipelines.

Overview

Trek10 helps you deliver on the promise of IoT by guiding you through the process of connecting your devices to AWS and by designing, implementing, and fully supporting your AWS cloud infrastructure.

Overview

Trek10’s security solutions and services will secure your AWS APIs and infrastructure. Schedule a meeting today to see if you qualify for a free security scan and report.

Support

CloudOps 24/7 Monitoring & Support

CloudOps Team Support

Overview

Trek10 brings managed services to the cloud. Our team works hard to reduce noise and maximize uptime in every AWS environment we manage.

Overview

Trek10 Team Support augments your team’s skills with access to a team of experienced and focused AWS solutions architects and cloud developers that specialize in leveraging AWS to the fullest.

Overview

Everyone who moves to AWS wants to secure their environment, but knowing where to start is hard. That is where Trek10 can help.
Case Studies
About
AWS Premier Partner
Community
CloudProse Blog

Spotlight

Serverless

Cost and Pricing Analysis

Cloud Native

Developer Experience

Databases

News

IoT

Monitoring, Ops & DevOps

Containers

Security and IAM

Generative AI and Machine Learning (ML)

Search Trek10

Cloud Native

How to Speed Up API Data Collection in Lambdas

Using a little-known module named requests_futures can dramatically speed up the consumption of remote APIs, even faster than preemptive multitasking.

Michele Mike Hjorleifsson Featured Team Member

Michele (Mike) Hjorleifsson | Dec 01 2022
7 min read

Overview

As more and more private and public APIs become available, we find ourselves consuming these APIs for a myriad of reasons. Of course, we want those to be as fast as possible, to reduce our compute costs and get our data collection completed more quickly. So why not test performance using an asynchronous library vs. the built-in standard multithreading and multiprocessing capabilities of Python? And while we are at it, let's make this test quick to iterate and deploy using another tool provided by AWS called Chalice.

What is Concurrent?

Concurrent is a module that has been included in the standard libs of Python since version 3.2; Primarily used for parallel programming, it also has some more interesting use cases for day-to-day projects. When you need to process a bunch of data for different calculations it is quite invaluable. But is concurrent going to help with fetching API calls? After all, the majority of the time is spent waiting on the remote server's response not crunching the resulting data.

Concurrent Use Cases

Preemptive multitasking
- Allows operating system to decide when to switch between tasks external to Python itself (i.e. requesting data from an API endpoint)
Cooperative multitasking
- The tasks decide when to give up control. (i.e. asyncio calls)
Multiprocessing
- Processes execute on all processors simultaneously
- NOT currently supported in AWS Lambda (you will get an OS not implemented error)

Async Use Cases

Fetching web content without waiting for results
- Fire and forget method, send your request then fire more requests and store the data as it returns.
- Widely used in JavaScript.

So how did the tests turn out? Well, the setup for using the concurrent library was simple to run. While preemptive multitasking is supported, true multiprocessing functions are not supported in Lambda yet. The results of running the standard requests module versus using preemptive multitasking the requests module were disappointing. A deeper look was needed and the source of the disappointment is due to the fact that we were waiting for APIs to return information not lacking processing capacity or speed.

So how to proceed? There must be a faster way to process multiple API requests than a standard for loop. Utilizing the requests_futures library for async requests of the APIs was a factor of 3.5-4x faster than the preemptive multitasking of the requests module. Primarily this method directly addresses the “wait” issue in a similar way that promises do in JavaScript. Let's dive deeper into async and the requests_futures module.

Let’s Explore Async Python with Lambda and Chalice

Proof, as they say, is in the pudding, but in our case, it will be at the speed in which we can gather some information from several public and freely available APIs. For simplicity and to allow the majority of folks to try this experiment themselves without a lot of account and API token creation, I have selected a simple public, free, and tokenless API(s) to use for the exploration.

What are we gathering?

List of Universities in over a dozen countries, results numbering approx. 5000.
- http://universities.hipolabs.com/search?country=COUNTRY

Let’s Code

As a Pythonista I am a huge fan of Chalice (https://aws.github.io/chalice/) as it provides a simple way to create Python-based AWS Lambda APIs. Let’s take a quick look at Chalice then code our gathering of the API data. Looking at the quickstart for Chalice you will see that it is simply a matter of installing Chalice using pip or an equivalent python package manager and that the Chalice commands are available using chalice --help. NOTE: You will need to have configured your AWS credentials and config file prior to being able to deploy your code.

Step One: Create the Chalice project and enter the project directory

chalice new-project restcollector

cd restcollector

You will also have to add 2 entries (one on a line) to the requirements.txt file so Chalice knows to pull down these libraries so we can use them in our code.

requirements.txt

Step Two: Edit the app.py with your favorite editor to add our initial non-async attempt as seen in the screenshot below (my inline comments were removed to keep the screenshot smaller).

Step Three: Chalice has a wonderful feature that lets us test locally; you can invoke this with:

chalice local

Then access your function with a browser http://localhost:8000

Your results should look something like this (I put the time in bold), times will vary based on your machine's performance and your internet connection, that’s ok we are just checking for the correct code at this point.

Restarting local dev server.

Starting Timer...

Serving on http://127.0.0.1:8000

Total Execution time from invocation: 1.2187955

127.0.0.1 - - [03/Oct/2022 15:15:47] "GET / HTTP/1.1" 200 -

127.0.0.1 - - [03/Oct/2022 15:15:52] "GET /favicon.ico HTTP/1.1" 403 -

Step Three: Deploy your code by pressing Ctrl-C to quit the local execution (may take a moment to clean itself up so don’t be impatient) then executing the chalice deploy command, and as long as your AWS credentials are setup properly you will get a response that looks like the one below with a URL for you to test your new Lambda (I have removed one piece of data in the ARN and replaced with an X.

chalice deploy

Creating deployment package.

Creating IAM role: restcollector-dev

Creating lambda function: restcollector-dev

Creating Rest API

Resources deployed:

- Lambda ARN: arn:aws:lambda:us-east-1:X:function:restcollector-dev

- Rest API URL: https://mdukptzumj.execute-api.us-east-1.amazonaws.com/api/

Step Four: Let’s go see how long it took by looking at the Amazon CloudWatch log group that Chalice created for us as part of the deployment. Open a browser and log in to your AWS console, type “CloudWatch” in the Search for Services, features, blogs, docs, and more area in the top menu, and press return.

Select Log Groups from the left navigation menu and click on the Log Group that was created by Chalice (should be the same name you gave the project, restcollector in my case). You will see one or more Log Streams depending on how many times you refreshed or opened the URL to get results. Click the one with the most recent Last event time. You will see something like the screenshot below, I executed the code twice to avoid cold start and requirements pull becoming a factor in the execution time comparison:

Notice the Starting Timer from our print statement in the code and the best Total Execution time is noted there, make a note of it, in this case, it was .2844569683074951

Let’s Improve

Great, now we have a working Lambda function powered by the Amazon API Gateway that we deployed with one command (chalice deploy). Now let's refactor our code to use the requests_futures module I mentioned earlier.

Step One: Make a copy of your original file as a reference. In whatever code editor or in the command line, just copy the file to app.py.bak

Step Two: Enter the code seen below to utilize the requests_futures module. Notice its very similar code with a few simple additions to handle the request, and return response. Run chalice local and test the code to make sure it’s all debugged and ready to go. (my inline comments were removed to keep the screenshot smaller).

Step Three: Deploy your code by pressing Ctrl-C to quit the local execution of Chalice (may take a moment to clean itself up so don’t be impatient) then executing the deployment command, and as long as your AWS credentials are setup correctly you will get a response that looks like the one below with a URL for you to test your new Lambda (I have removed one piece of data in the ARN and replaced with an X.)

chalice deploy

Creating deployment package.

Creating IAM role: restcollector-dev

Creating lambda function: restcollector-dev

Creating Rest API

Resources deployed:

- Lambda ARN: arn:aws:lambda:us-east-1:X:function:restcollector-dev

- Rest API URL: https://2jg5vrlky8.execute-api.us-east-1.amazonaws.com/api/

NOTE: The URL is different than the previous iteration

Step Four: Let’s go see how long it took by looking at the Amazon CloudWatch log group that Chalice created for us as part of the deployment. Open a browser and log in to your AWS console, type CloudWatch in the Search for Services, features, blogs, docs, and more area in the top menu, and press return.

Notice the Starting Timer from our print statement in the code and the Total Execution time is logged there, make a note of it, in this case, it was .2309241294860398 [ 19.4% Faster ]

Cleaning Up

Great, now we have a working AWS Lambda function powered by the Amazon API Gateway that we deployed with one command (chalice deploy). And it's 19.4% faster without tweaking anything. Some memory adjustments to the Lambda to give the preemption a little more working room could further improve performance, feel free to make the adjustments in the console under Lambda > Configuration (I ran mine with 512MB of memory). When you are done it’s time to clean up and make sure we don’t leave anything behind, never the fun part right?

Step One: Run chalice delete and Chalice will do all the heavy lifting. Removing the Lambda, API Gateway, Cloudwatch Log group, etc. That wasn’t so bad now was it, and yep you are all done.

In Closing

To recap, we have created sets of code, one traditional and one using preemptive multitasking, and were able to test and deploy and remove them with four simple commands.

chalice init chalice local chalice deploy chalice delete

Leveraging async libraries is just one of the ways you can get more bang for your AWS Lambda buck by better utilizing the programming language's natural tools to improve performance, sometimes dramatically. Coupled with the ease of Chalice to test, iterate, deploy and redeploy your code in moments, it’s quite a powerful set of tools allowing you to query approx. 5k records from a dozen or more websites and aggregate the data with about 70 lines of code. Hope you enjoyed this little introduction to Chalice and Python’s requests_futures and concurrent libraries. For more information on these libraries you can access the official documentation here:

Concurrent Execution — Python 3.9.14 documentation

ross/requests-futures: Asynchronous Python HTTP Requests for Humans using Futures (github.com)

Visit the Github repo to access the code referenced in this article.

Author

Michele (Mike) Hjorleifsson

Go to Stories by Michele (Mike)

Similar Blog

Cloud Native

Control Tower: Then vs Now

Control Tower today is not the same Control Tower that you may have been introduced to in the past.

Matt Skillman | Dec 18 2023
5 min read

Cloud Native

Using AWS XRay for ECS Observability

Learn how AWS X-Ray is a vital tool for enhancing the observability of containerized applications on ECS.

Michele (Mike) Hjorleifsson | Sep 13 2023
10 min read

Cloud Native

How and When to Use Amazon EventBridge Pipes

Amazon EventBridge Pipes: Useful, but not magical.

Matt Skillman | Aug 28 2023
4 min read

Questions about Serverless?

Learn more and connect. It's our favorite topic.

Serverless Expertise

Overview

Overview

Overview

Related Content

AWS Lambda

Blog

What is Serverless and Why Does it Matter?

Overview

Related Content

Blog

Serverless Architectures: IoT

Blog

Is IoT Device Shadow Right for You?

Overview

Related Content

AWS CloudFormation

Containers on AWS

Overview

Related Content

Amazon WorkSpaces

Containers on AWS

Overview

Related Content

Machine Learning Ops

Amazon SageMaker

Overview

Related Content

Developer Acceleration

Overview

Related Content

Amazon CloudWatch

Disaster Recovery

Overview

Related Content

Amazon CloudWatch

Disaster Recovery

Overview

Related Content

AWS Premier Partner

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview