InteliBridge MCP: Unlock up to $30k to build your Model Context Protocol (MCP) Server. Join the waitlist →

Services
Focus Areas

Areas of Expertise
Engagements

Discover

Build

Support
Areas of Expertise

App Modernization

Public Sector

Serverless

IoT

DevOps

Migration

Data and Machine Learning (ML)

Enterprise Architecture

24/7 Monitoring

Team Support

Datadog

Overview

Are you taking advantage of modernizing your AWS apps to protect your cloud investments?

Overview

Our mission is to accelerate high-quality cloud adoption across the Public Sector.

Overview

Whether you are new to serverless or looking to scale, Trek10 allows you to focus on building applications, not managing servers.

Related Content

AWS Lambda

With AWS Lambda, you can run code without the need for managing servers in a cost-effective manner.

Blog

What is Serverless and Why Does it Matter?

Overview

Whether you’re looking to gain visibility into plant floor machinery or seeking to enhance process efficiency, Trek10 can help.

Related Content

Blog

Serverless Architectures: IoT

Blog

Is IoT Device Shadow Right for You?

or should you build-your-own with DynamoDB?

Overview

Shorten the development lifecycle, increase reliability, and release software faster.

Related Content

AWS CloudFormation

AWS CloudFormation helps you save time and money by configuring and managing resources for you.

Containers on AWS

Containers on AWS makes managing container registries easy, autonomous, reliable, and safe from anywhere.

Overview

At Trek10, we rapidly migrate your applications with a focus on cost-effectiveness

Related Content

Amazon WorkSpaces

Amazon WorkSpaces allows you to quickly scale according to your virtual desktop needs.

Containers on AWS

Containers on AWS makes managing container registries easy, autonomous, reliable, and safe from anywhere.

Overview

Uncover insights from your data no matter where you are in your analytics journey.

Related Content

Machine Learning Ops

MLOps constitute best practices for developing, deploying, and monitoring high precision Machine Learning models.

Amazon SageMaker

Amazon SageMaker enables developers and data scientists to easily build ML models.

Overview

Enterprise Architecture (EA) combines business and technology in a proven industry recognized framework to deliver business focused results based on your industry, environment, competition and the ever increasing capabilities of cloud technologies.

Related Content

Developer Acceleration

A series of in-person architect-led training modules designed to help your team develop the necessary skills and best practices to modernize your applications.

Overview

Maximize the uptime and security of your most critical applications.

Related Content

Amazon CloudWatch

Amazon CloudWatch makes performance monitoring simple for you and your business.

Disaster Recovery

Prevent downtime, strengthen resilience, and avoid unanticipated costs with a comprehensive Disaster Recovery Plan.

Overview

Experienced solutions architects and developers at your service, on-demand.

Related Content

Amazon CloudWatch

Amazon CloudWatch makes performance monitoring simple for you and your business.

Disaster Recovery

Prevent downtime, strengthen resilience, and avoid unanticipated costs with a comprehensive Disaster Recovery Plan.

Overview

Let Trek10 help you hit the ground running with Datadog.

Related Content

AWS Premier Partner

Discover

Cloud-Native Immersion Day

Developer Acceleration

Retail | Industry Overview

SaaS on AWS

Serverless Workshop

Overview

Trek10's Cloud-Native Immersion Days are focused, high impact training sessions that will drench your teams in knowledge of the latest tech and best-practices.

Overview

Trek10’s expert-led Developer Acceleration workshops help enterprise teams quickly and safely jump-start their serverless journey.

Overview

Leveraging the vast capabilities of the AWS ecosystem, Trek10 provides retail businesses with solutions tailored to their unique needs, enabling them to innovate at speed and scale.

Overview

Trek10 helps companies migrate and build their SaaS offering on AWS with a cloud-native approach.

Overview

Whether it’s a greenfield project or re-architecting legacy, Trek10 is your guide to adopting cloud native architectures.

Build

DevOps Transformation

Internet of Things (IoT) Applications

Security

Overview

At Trek10, we leverage the best AWS native and third party tools for code-defined infrastructure, continuous integration, and automated deployment pipelines.

Overview

Trek10 helps you deliver on the promise of IoT by guiding you through the process of connecting your devices to AWS and by designing, implementing, and fully supporting your AWS cloud infrastructure.

Overview

Trek10’s security solutions and services will secure your AWS APIs and infrastructure. Schedule a meeting today to see if you qualify for a free security scan and report.

Support

CloudOps 24/7 Monitoring & Support

CloudOps Team Support

Overview

Trek10 brings managed services to the cloud. Our team works hard to reduce noise and maximize uptime in every AWS environment we manage.

Overview

Trek10 Team Support augments your team’s skills with access to a team of experienced and focused AWS solutions architects and cloud developers that specialize in leveraging AWS to the fullest.

Overview

Everyone who moves to AWS wants to secure their environment, but knowing where to start is hard. That is where Trek10 can help.
Case Studies
About
Careers
AWS Premier Partner
Community
CloudProse Blog

Spotlight

Serverless

Cost and Pricing Analysis

Cloud Native

Developer Experience

Databases

News

IoT

Monitoring, Ops & DevOps

Containers

Security and IAM

Generative AI and Machine Learning (ML)

Search Trek10

Amazon Kinesis Data streams billing mode Hero Img

Data and Analytics

Amazon Kinesis Data Streams On-demand vs. Provisioned Billing Mode Cost Comparison

Is on-demand pricing really ‘serverless’ pricing?

Joel Haubold | Sep 21 2023
5 min read

When AWS released on-demand billing for Kinesis Data streams in Nov 2021, pitching it as “serverless,” my first thought was: isn’t Kinesis Data Streams more or less already serverless? Yes, there is an hourly cost per shard-hour in provisioned mode but when you are using more than 30% of the shard’s capacity the per-request cost is more than the shard-hour cost. A “true” serverless offering would mean that cost and usage are perfectly correlated. That is, zero usage would equate to zero cost—the amount of money you pay to use a serverless offering is purely determined by your actual usage of the service. Because Kinesis Data Streams’ cost [when using provisioned streams] is primarily determined by the per-request cost after shard utilization exceeds 30% (and we can expect that many real-world scenarios will involve > 30% shard utilization), we can say that the service is already almost serverless with respect to its billing model. With this in mind, let’s explore the cost differences between the new serverless billing mode for Kinesis Data streams when compared against the existing “almost serverless” provisioned mode.

The pricing structure (prices from us-east-1) between provisioned and on-demand streams has these differences:

Cost Type	Provisioned	On-demand
Metered Size (or how each record is rounded)	25kb	1k for writes (none for reads)
Hourly rate	$0.015 per shard hour	$0.04 per stream hour
Price per gb inbound (or metered chunk since all inbound traffic is rounded)	$0.014 per million PPUs (25kb chunk) For 500 byte payloads this is $0.028/gb For 25kb payloads this is $0.00056/gb	$0.08 per gb (payloads rounded up to nearest kb)
Price per gb outbound (or metered chunk) outbound, excluding enhanced fanout.	Free	$0.04/gb (no rounding)
Enhanced Fanout (used if you need to read more than 2mb/s per shard, basically you can use this if you have more than two readers on a stream)	$0.013/gb retrieved + $0.015/consumer-shard-hour	$0.05/gb retrieved
Extended Retention (7 days) By default, records are only retained 24 hours in a stream	$0.02/shard hour	$0.10/gb month
Scaling behavior	Manually adjust number of shards by 50%-200% or split/merge adject shards	Automatic. Adjusts capacity to double peak usage in the past 30 days. 15 minute scaling delay

On-demand scales to double your peak write throughput in the previous 30 days. It doesn’t scale based on your reads. Also if you increase above double your peak usage within 15 minutes you can still get throttles. Hot shards can still be a problem since on-demand scaling mode doesn’t isolate specific hash keys. With provisioned mode, you have to specify the number of shards you want. You can scale a provisioned mode stream up or down by a factor of two. There is unfortunately no AWS built-in way to scale provisioned mode streams (e.g. Application Auto scaling doesn’t support Amazon Kinesis Data streams).

Gain a Competitive Advantage with Data and Analytics on AWS

*Reading data twice would favor on demand, comparing the cost of enhanced fanout is beyond the scope of this analysis.

Here are some graphs showing the cost vs. utilization rate for provisioned streams. Each graph also shows the cost for processing the same data in an on-demand stream. All the graphs show the cost for reading the written data once.* Since the payload size affects the pricing we have graphs for 100 byte, 1kb, 25kb, and 100kb payloads. (Because the provisioned streams have 25kb metering but the on-demand only has 1kb, the payload size will affect the price differently in the two modes. Generally, larger payloads favor provisioned streams since it has a larger metering size).

The cost shown in these graphs for provisioned mode is based on 100 shards and the per-record cost at the utilization rate on the x-axis. The cost for the on-demand mode is based on sending the equivalent data through an on-demand stream. Based on these graphs we can see that the cost starts favoring provisioned mode around when utilization is over 5%.

Based on the 5% utilization break-even point seen above, we can compare the two billing modes at various request sizes and request rates. The following graphs show the cost for provisioned and on-demand mode at various requests per second. The number of shards for provisioned mode cost is set such that only 5% of the stream’s capacity is used.

The four graphs from above show us that at 5% utilization, the provisioned mode is cheaper for all but the 100-byte payload size. Generally larger payload sizes favor provisioned mode because of the difference in rounding of the payload sizes: 1kb vs. 25kb respectively for on-demand and provisioned mode.

Let’s consider the following utilization scenarios to see which billing mode would be cheaper:

50% utilization for 8 hours a day and 0% utilized all other times gives 11% utilization on average, which from the graphs above we can see would be cheaper with provisioned mode.
50% utilization for 7 hours a week (e.g. 1hr a day) and 3% utilization at all other times gives 5% utilization on average. Depending on the payload size provisioned or on-demand could be cheaper.
75% utilization for 1 hour a week and 1% utilization all other times gives 2.3% utilization on average. For all payload sizes, on-demand would be cheaper in this case.

To measure the utilization rate of on-demand streams take the maximum value of the following two formulas:

(Sum(IncomingBytes) / Period) / (Number of shards * 1000000)
(Sum(IncomingRecords) / Period) / (Number of shards * 1000)

where Period is the number of seconds specified in the CloudWatch Get Metric Statistics API call or in the CloudWatch Console. (Each shard can support 1mb/sec or 1000 records/sec, so we need to calculate both the utilization rate for bytes per second and utilization rate for records per second and take the maximum to find the rate that will limit the stream first)

Takeaways

If you need 3 or fewer shards, use provisioned. It’s always cheaper.
On-demand is a good fit if your utilization rate is < 5% and you can’t scale your provisioned shards down to increase your utilization rate.
Provisioned scaling can be hard. It can be slow to scale when it is not scaling up or down by a factor of 2. Scaling also is hard if you are using IaC to configure your Kinesis Data Streams. For advanced users with hot/cold shards, you can enable per-shard metrics and split/merge shards individually but this increases operational complexity.
Generally, on-demand is better for smaller payload sizes because of the 25kb metering in provisioned vs. 1kb metering for on-demand writes and no rounding for reads.
Best practice for any cost optimization is to right-size instances before purchasing RIs. Similarly, with Kinesis Data Streams, make sure you don’t have too many shards (e.g. utilization rate is too low) before switching to on-demand.

References and related reading

Author

Joel Haubold

Go to Stories by Joel

Similar Blog

Spotlight

Demoing the Blues Wifi + Cell Communication Module

Explore the Blues Cell + Wifi communication module on a Raspberry Pi Zero, Notehub, and thoughts on the pros and cons of utilizing Blues in your IoT project.

Justin Courtright | Dec 21 2024
6 min read

Data and Analytics

Data Lakehouses Unleashed: How AWS and Apache Iceberg are Changing the Game

An exploration of how to build a data lakehouse entirely in Amazon S3.

Ryan Farina | May 22 2023
8 min read

Data and Analytics

Guide to Filtering in Amazon QuickSight

A comprehensive guide on Filters, Filter Controls, Actions, Parameters, and other QuickSight features to leverage filter capabilities in your dashboards.

Miguel Campos Ribau Featured Team Member

Miguel Campos | Apr 25 2023
12 min read

AWS Data & Analytics Expertise

Explore more resources from Trek10

Browse | Learn | Connect

Overview

Overview

Overview

Related Content

AWS Lambda

Blog

What is Serverless and Why Does it Matter?

Overview

Related Content

Blog

Serverless Architectures: IoT

Blog

Is IoT Device Shadow Right for You?

Overview

Related Content

AWS CloudFormation

Containers on AWS

Overview

Related Content

Amazon WorkSpaces

Containers on AWS

Overview

Related Content

Machine Learning Ops

Amazon SageMaker

Overview

Related Content

Developer Acceleration

Overview

Related Content

Amazon CloudWatch

Disaster Recovery

Overview

Related Content

Amazon CloudWatch

Disaster Recovery

Overview

Related Content

AWS Premier Partner

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Overview

Data and Analytics

Amazon Kinesis Data Streams On-demand vs. Provisioned Billing Mode Cost Comparison

Takeaways

References and related reading

Author

Joel Haubold

Similar Blog

Spotlight

Demoing the Blues Wifi + Cell Communication Module

Data and Analytics

Data Lakehouses Unleashed: How AWS and Apache Iceberg are Changing the Game

Data and Analytics

Guide to Filtering in Amazon QuickSight

AWS Data & Analytics Expertise

Check out our Case Studies!

Contact Us