Sunday, September 08, 2019

Tuning DB Parameters for PostgreSQL 12 in Amazon RDS

In my last entry, we saw how to setup PostgreSQL 12 beta 3 in Amazon RDS. In that entry I purposely left out how to change database parameters as I realized that it deserves an entry (or more) by itself.

Using the AWS CLI you can create a new database parameter group as follows:

$ aws rds create-db-parameter-group  --db-parameter-group-name jkpg12pg    \
 --db-parameter-group-family postgres12 --description "My PostgreSQL 12 Parameter Group" \
 --region us-east-2 --endpoint https://rds-preview.us-east-2.amazonaws.com  


We have just created a group and not applied the parameters to any database. Before we apply, we do to see what are the default values  in the created group. You can run a command as follows to see values being set by default in the group.

$ aws rds describe-db-parameters --db-parameter-group-name jkpg12pg  \
--region us-east-2 --endpoint https://rds-preview.us-east-2.amazonaws.com \
--query 'Parameters[].[ParameterName,ParameterValue]' --output text

The output contains a list of parameters with values. Let's look at some of the values to see how to interpret them.


application_name None
..
autovacuum_max_workers GREATEST({DBInstanceClassMemory/64371566592},3)
autovacuum_vacuum_cost_limit GREATEST({log(DBInstanceClassMemory/21474836480)*600},200)
..
effective_cache_size {DBInstanceClassMemory/16384}
..
jit None
..
maintenance_work_mem GREATEST({DBInstanceClassMemory*1024/63963136},65536)
max_connections LEAST({DBInstanceClassMemory/9531392},5000)
..
shared_buffers {DBInstanceClassMemory/32768}
shared_preload_libraries pg_stat_statements
..
work_mem None
xmlbinary None
xmloption None
       

When you see None it basically is equivalent to being not set in postgresql.conf and the default value of the PostgreSQL version engine is used by PostgreSQL. In the above example, you will notice that jit is set to None which means it will take the default  ON value in PostgreSQL 12 and enable jit in the instance.

If you change a parameter set to a specific value based on the supported type for that parameter, then that value (based on the supported unit) will be used instead of the default value in PostgreSQL. For example you see that shared_preload_libraries has a default value of pg_stat_statements. Which means when you deploy a PostgreSQL 12 instance, pg_stat_statements extension will have the library preloaded and available for it without requiring another restart.

Other interesting parameters are ones with values containing {} or with GREATEST or LEAST functions. These values are using macro functions allowing you to set them based on the DBInstanceClassMemory (in bytes) based on the instance size used by the database instance.

For example, shared_buffers is set to {DBInstanceClassMemory/32768} . In PostgreSQL, when there are no units, shared_buffers is set to the number of 8KB  pages based on the value.
So in this example it shows that it is set to  25% or 1/4th of total RAM in bytes in terms of 8192 bytes =   (RAM/8192 )/4 or RAM/32768.


Setting values is an important task to get optimum usage of a PostgreSQL database. Lets look at how I think of setting these values for an instance.

Lets consider a case of an RDS instance type of db.m4.16xlarge with 64GB as RAM. For simplicity, I am not considering basic Linux kernel memory and RDS monitoring and other OS processes but filesystem cache will be considered as that is significant portion of memory used by a PostgreSQL instance. The other major component are shared buffers which is a common shared memory area used by PostgreSQL processes. The final component is the aggregate of all individual private memory of each connection of PostgreSQL database.


TOTAL RAM = Filesystem Cache + Shared DB Buffers Cache + Sum of all  PostgreSQL connections


By default on RDS, the shared buffers are set to 25% of RAM. It would be fair to consider that file system cache usage could be equal or greater than that as all pages will come through file system cache and constitutes 25-30% of RAM. While shared_buffers can be controlled by the shared_buffers parameter, the file system cache cannot be controlled directly, however it can be freed by OS during low memory situations. So using our example of 64GB total memory, we already have accounted for 16GB + 16-20GB of file system cache, leaving about 28-32GB of memory free for your private memory consumed by database connections. In the rest of the calculation we assume this two part to be roughly 50% of RAM

Private memory of database connections are hard to easily measure as they are not 'RSS' of a process but 'RSS' - touched shared memory pages and depends on the number of connections and chunks of work_mem consumed by each connection.

For capacity calculation we could use something simple as

      Average Memory per PostgreSQL connections * Concurrent Connections <= 50% of RAM 

where Average memory per PostgreSQL connection can be simplified to say n * work_mem + process overhead where n can vary by the type of queries. For example a query with JOIN of two tables and ORDER BY can end up using 2 work_mem chunks along with memory process overhead. Putting that into numbers with a default work_mem of 4MB and an approximate process overhead of say roughly 5MB of PostgreSQL (if Linux huge pages are disabled then this number may need to bumped on higher side), each PostgreSQL connection is about 2x4 + 5 = 13MB. If you have 1,000 concurrent MB you may end up consuming about 13GB and for 2,000 connections that number can jump to 26GB. Hence we should make sure that

Work_mem <=   (  50% RAM in KB  /concurrent_connections   -  5,000 KB)/2

Hence query tuning, lowering work_mem, max_connections can help control this component of the memory. If your queries actually ends up requiring more work_mem memory, then your default 25% of shared_buffers needs to be reduced down to make more memory available for your work_mem and max_connection needs. It is perfectly reasonable to take down the percentage of shared_buffers to fit the number of concurrent connections as it is better to have lower hit ratio of buffer pool than ending up using swap space.

The above perspective is a simple way to look at it. I am ignoring other things like temporary tables in memory and other temporary memory that will also consume memory with the assumption queries are simple join and order by queries. But if you are using temporary tables and have large analytical queries, you have to account for that memory in your average memory per PostgreSQL connection to arrive at the usage needed and then maybe reduce shared buffers to make sure the total usage is within total RAM and not end up using swap or causing large flush of the file system cache.

If you want to lower your shared buffers to say 20% instead of the default 25%, you would change the macro for the parameter to be set to (RAM/8192)/5 or {DBInstanceClassMemory/40960}

To override a parameter in a particular group you can do as follows:

$ aws rds modify-db-parameter-group --db-parameter-group-name jkpg12pg \
 --region us-east-2 --endpoint https://rds-preview.us-east-2.amazonaws.com  \
 --parameters "ParameterName=shared_buffers,ParameterValue=\"{DBInstanceClassMemory/40960}\",ApplyMethod=pending-reboot" 



When you list it again you see the change in the parameter group. If you notice that for this parameter, the ApplyMethod is pending-reboot. For static parameters you can only apply them on reboot and shared_buffers is a static parameter. For dynamic parameters you can also use "immediate" as ApplyMethod, which will apply the changes immediately to all database instances using the parameter group.

In our case we have still not applied it to the database yet so it does not matter. Lets first apply it to our database.

$ aws rds modify-db-instance \
 --db-instance-identifier mypg12b3 --db-parameter-group jkpg12pg \
 --region us-east-2 --endpoint  https://rds-preview.us-east-2.amazonaws.com 


Note however changing the group does not mean all the parameters are in effect. Since static changes can only be applied after a reboot, we will reboot the instance as follows:

$ aws rds reboot-db-instance  --db-instance-identifier mypg12b3  \
--region us-east-2 --endpoint  https://rds-preview.us-east-2.amazonaws.com  

With the reboot now we have a database running with a custom parameter group which has parameters sets tuned based on the expected workflow. You may not get them right in the first try but now you know how to set them and apply them to the database using the CLI commands.



This blog represents my own view points and not of my employer, Amazon Web Services.


52 comments:

Bhanu Sree said...

Informative Blog
AWS Training
AWS certification training

IICT said...

Rds concepts on aws are explained in this blog perfectly.
AWS Training in Chennai

svrtechnologies said...

Pretty article! I found some useful information in your blog, it was awesome to read, thanks for sharing this great content to my vision, keep sharing. devops training in bangalore

Bhanu Sree said...

Interesting blog, here a lot of valuable information is available, it is very useful information.
RedShift Training
AWS RedShift Training
Amazon RedShift Online Training

Raj Sharma said...

Nice Post. I like your blog. Thanks for Sharing.
AWS Course in Noida

manisha said...

Thanks for sharing your valuable information and time. 
AWS certifications training in Delhi
AWS Training in Delhi
AWS Training institute in Delhi

Unknown said...

Poker online situs terbaik yang kini dapat dimainkan seperti Bandar Poker yang menyediakan beberapa situs lainnya seperti http://62.171.128.49/hondaqq/ , kemudian http://62.171.128.49/gesitqq/, http://62.171.128.49/gelangqq/, dan http://62.171.128.49/seniqq. yang paling akhir yaitu http://62.171.128.49/pokerwalet/. Jangan lupa mendaftar di panenqq silakan dicoba ya boss

IICT said...

Great article with lots of useful information on AWS Training in Chennai wonderful explanation on AWS Courses.
AWS Training Center in Chennai | Best AWS Training in Chennai | AWS Training Institute in Chennai

keerthana said...

great and nice for the article.
PHP Training in Chennai | Certification | Online Training Course | Machine Learning Training in Chennai | Certification | Online Training Course | iOT Training in Chennai | Certification | Online Training Course | Blockchain Training in Chennai | Certification | Online Training Course | Open Stack Training in Chennai |
Certification | Online Training Course

Alfred Avina said...

It is an effective method to understand the data transformation services or to know more about data warehouses .

dhinesh said...

An overwhelming web journal I visit this blog, it's unfathomably amazing. Unusually, in this present blog's substance made inspiration driving truth and reasonable. The substance of data is enlightening.


Full Stack Course Chennai
Full Stack Training in Bangalore

Full Stack Course in Bangalore

Full Stack Training in Hyderabad

Full Stack Course in Hyderabad

Full Stack Training

Full Stack Course

Full Stack Online Training

Full Stack Online Course



pavi said...

Am really impressed about this blog because this blog is very easy to learn and understand clearly.This blog is very useful for the college students and researchers to take a good notes in good manner,I gained many unknown information.



Data Science Training In Chennai

Data Science Online Training In Chennai

Data Science Training In Bangalore

Data Science Training In Hyderabad

Data Science Training In Coimbatore

Data Science Training

Data Science Online Training

kirankumarpaita said...

software testing company in India
software testing company in Hyderabad
Thank you so much for sharing your brilliant thoughts with us.
Nice information.
please keep sharing.

Huongkv said...

Aivvu chuyên vé máy bay, tham khảo

vé máy bay đi Mỹ hạng thương gia

ve may bay tet 2021

đặt vé máy bay đi toronto canada

vé máy bay đi Pháp

giá vé máy bay từ việt nam sang Anh

trang web vé máy bay giá rẻ

combo đà nẵng 5 ngày 4 đêm

combo nghỉ dưỡng nha trang

Verify Customer Identity said...

The actual demand for id verification service is higher these days. Trust Swiftly is the only platform that has distinct verification approaches, and an organization can bring superior safety to their organization by implementing all of the verification systems. When online searchers use this trustswiftly site, they grab information about id verification service.

Pradeep Appslure said...

great this information
app development company in delhi

Emma King said...


Hii,
This is Great Post!!
Thanks for sharing with us!! this is Really useful for me.. Please Keep here some updates.Website Development Services in AU

i347 Studio said...

i347 Studio is the premier provider of a website designing company in Delhi, website designing is a way to refers to the design of websites that are displayed on the internet.

Unknown said...

SAP ERP Online Training
SAP ERP Training in Bangalore
SAP ERP Training in Chennai

SAP GRC Online Training
SAP GRC Training in Bangalore
SAP GRC Training in Chennai

SAP QM Online Training
SAP QM Training in Bangalore
SAP QM Training in Chennai

Gyanas said...

It is Very nice post. I just stumbled upon your weblog and wished to say that I have truly enjoyed surfing around your blog posts. Web Designing Services in Gurgaon

CSS Founder said...

As we all know that website is the most important asset nowadays to increase your business. Connect with the best Website designing company in Delhi to get world-class website design for your business and bring your offline business online.

Anonymous said...

Are you planning to market your business online and looking for a digital marketing company in Delhi? Visit Techdost.com and grab the opportunity to get a discount on Digital Marketing Services in Meerut.
We are an organised, trustworthy and reliable digital marketing agency providing high-quality and cost-effective digital marketing services in Delhi and all over India.
Digital Marketing Company in Delhi
Social Media Marketing Expert in Meerut
Android App Developer in Meerut

Prince said...

You can get the best iPhone Insurance by connecting with your mobile insurance. We are associated with 25+ countries and we provide fast & reliable service of product repairing or replacement

Prince said...

Thanks for sharing this nice information by posting this blog. Guys, you can connect with us if you are looking for the best tent rental service provider in Dubai, United Arab Emirates. Connect with us to get the best tent and make your party and event memorable.
Car parking shades Dubai

Prince said...

You can hire the best SEO Company in Ghaziabad that is PromoteDial for growing your online business. If you are not able to get more traffic and customers towards your website then we can be the right choice for you.

Unknown said...

Really your post was so nice and energetic I like your post regularly and i read your all post regularlly.Daily UPSC Prelims Quiz

TechDost Services said...

DaiyaMondo is an e-commerce company based in India.

We deal in Electronics, Personal Care, Lifestyle, etc.

We prefer quality over quantity products because it’s our responsibility to provide the best value to our customers.

We are committed to customer satisfaction by understanding their needs or requirements to deliver suitable and best products. We are always ready to improve ourselves if there is any issue “Please don't forget to inform us” We love to hear your problem! We will try our best to resolve it as soon as possible.

Steadfast services said...

Connect with Steadfast Services if you are interested to get work permit, travel visa, work visa etc in Dubai, Poland, Malta and Europe. We can be the right choice for you.
Immigration Services in Dubai

Exotica housing said...

If you looking for a Housing and flat so Exotica Housing will be the best quality product for you so go with exotica housing
Best Developer in Noida

Akash Kanaujiya said...

Do you want to build a website? Whose goal is to elevate your digital presence? Are you a business owner, entrepreneur or start-up, do you want to build a website specifically to promote your product, brand or idea? Then CSS Founder will help you, contact us for more information
Website design company Bern

CSS Founder said...

Are you finding the best Web design company in pune? If yes then connect with CSS Founder that can provide you attractive website for your online business.

Akash Kanaujiya said...

Because of the rapidly increasing web development and web designing, it is important for any company to have a website of its own. We should have our own website so that we can reach good information about our company CSS Founder is the best website design company, contact us to connect with CSS Founder
Web design company Tronto

cyber pc said...

that is in intention of reality exceptional to answer..informative pronounce is completely exceptional to entre..thank you loads! Virtual Audio Cable Crack is a powerful multimedia driver containing input, output designed to connect one or more audio applications. Virtual Audio Cable

cyber pc said...

We current you a extensive selection of good night messages for him that you simply may send. Good Night Wishes For best Friends words and phrases will gladden these people https://wishesquotz.com/good-night-wishes-for-friends/

Natures Pired said...

The team at 36RPM have combined experience of 12 years in developing marketing strategies and developing E-commerce websites & plans.

Seo Services in Gurgaon

thorndalemedicalclinic said...

It is a very informative and useful post. Thanks a lot. GP in Dublin

aquapro said...

Pure Water is the World’s First and Foremost Medicine. Ultraviolet 36 (GPM-HTM) Water Sterilizer

CSS Founder said...

We are a website designing company in Ghaziabad called CSS Founder. We can be the right choice for you.

accepthealthcare said...

I truly love how it is easy on my eyes and the data are well written. I am wondering. Patient Care Services in Jewar

Sakshi Singh said...

Thanks for sharing great articles, I appreciate your efforts. Visit Website Design Company In Delhi

Appconsultio said...

iOS App Development Company in Mumbai

Appconsultio said...

Flutter App Development Company in Pune

Pagadex739 said...

Thanks for sharing valuable info
The psychology of packaging and consumer behavior

CareTaker said...

PGWare GameGain Crack instantly improves the performance of your computer and compresses all the performance that computer gaming hardware can provide. Given that many of the latest PC games require high hardware requirements to have a proper gaming experience, it is important to optimize the performance of your computer.

Anonymous said...

Great post, thanks for sharing valuable information, keep posting Salesforce Course In Pune

breekelly said...

However, it is important to note that webtechmantra operates in a legal gray area. The site provides access to copyrighted material without the permission of the copyright owners. Streaming or downloading copyrighted material without permission is illegal, and users could face legal consequences for doing so.

Mr gacor said...

slot sultanking is a good place to play game and get alot of benefit

Kumpulan situs game slot online deposit ewallet said...

Sultanking merupakan salah satu situs slot gacor yang mempunyai beragam game yang terlengkap dan terpercaya di Indonesia.

informasicpns said...

You must check Sultanking

iteducationcentre said...

very informative and useful Post.
SQL Classes in Pune

SHIVAM SHARMA said...

informative blog, keep posting java classes in pune

Vishal Tomar said...

Regarding website designing, CSS Founder is the best website design company in Hyderabad. With their expertise and dedication, they deliver a seamless website experience that is tailored to meet your specific needs. Their team of professionals ensures that your website not only looks attractive but also functions smoothly.