Shaving of Microseconds

Slides:

Advertisements

Similar presentations

Big Data Working with Terabytes in SQL Server Andrew Novick

Advertisements

Project Management Database and SQL Server Katmai New Features Qingsong Yao

Meanwhile RAM cost continues to drop Moore’s Law on total CPU processing power holds but in parallel processing… CPU clock rate stalled… Because.

Microsoft SQL Server Administration for SAP SQL Server Architecture.

Russ Houberg Senior Technical Architect, MCM KnowledgeLake, Inc.

Sarah Sproehnle Cloudera, Inc

Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.

Application Data and Database Activities Auditing Dr. Gabriel.

Parallel Execution Plans Joe Chang

Large Data Operations Joe Chang

Visual Programing SQL Overview Section 1.

Table Structures and Indexing. The concept of indexing If you were asked to search for the name “Adam Wilbert” in a phonebook, you would go directly to.

How to kill SQL Server Performance Håkan Winther.

Optimizing SQL Server and Databases for large Fact Tables =tg= Thomas Grohser, NTT Data SQL Server MVP SQL Server Performance Engineering SQL Saturday.

Vedran Kesegić. About me  M.Sc., FER, Zagreb  HRPro d.o.o. Before: Vipnet, FER  13+ years with SQL Server (since SQL 2000)  Microsoft Certified.

SQL Advanced Monitoring Using DMV, Extended Events and Service Broker Javier Villegas – DBA | MCP | MCTS.

Processing Temporal Telemetry Data -aka- Storing BigData in a Small Space =tg= Thomas H. Grohser, SQL Server MVP, Senior Director - Technical Solutions.

Doing fast! Optimizing Query performance with ColumnStore Indexes in SQL Server 2012 Margarita Naumova | SQL Master Academy.

Database Design: Solving Problems Before they Start! Ed Pollack Database Administrator CommerceHub.

Introducing Hekaton The next step in SQL Server OLTP performance Mladen Prajdić

Creating Database Objects

With Temporal Tables and More

Size Matters Not =tg= Thomas Grohser, NTT Data SQL Server MVP

Establishing a Service Level Agreement SLA

Real-time analytics using Kudu at petabyte scale

What Is a Latch? …and Why Do I Care? Eddie Wuerch, mcm

Optimizing SQL Server and Databases for large Fact Tables

Module 2: Creating Data Types and Tables

Establishing a Service Level Agreement SLA

Establishing a Service Level Agreement SLA

MongoDB Er. Shiva K. Shrestha ME Computer, NCIT

SQL Server 2000 and Access 2000 limits

UFC #1433 In-Memory tables 2014 vs 2016

Data Definition and Data Types

A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)

Optimizing SQL Server and Databases for large Fact Tables

# - it’s not about social media it’s about temporary tables and data

# - it’s not about social media it’s about temporary tables and data

Building Modern Transaction Systems on SQL Server

From SLA to HA/DR solution

Powering real-time analytics on Xfinity using Kudu

Advanced Security Protecting Data from the DBA

Purpose, Pitfalls and Performance Implications

Super Scaling The LMAX Queue Pattern.

Migrating a Disk-based Table to a Memory-optimized one in SQL Server

DB storage architectures: Rows, Columns, LSM trees

Real world In-Memory OLTP

Why most candidates fail the interview in the first five minutes

What Is a Latch? …and Why Do I Care? Eddie Wuerch, mcm

Purpose, Pitfalls and Performance Implications

Service Level Agreement

Why most candidates fail the interview in the first minute

Optimizing SQL Server and Databases for large Fact Tables

In Memory OLTP Not Just for OLTP.

Statistics for beginners – In-Memory OLTP

Adding Lightness Better Performance through Compression

Why most candidates fail the interview in the first five minutes

The Curious case of the foreign key

Large Object Datatypes

In Memory OLTP Not Just for OLTP.

From SLA to HA/DR solution

Creating Database Objects

=tg= Thomas Grohser Column Store.

=tg= Thomas Grohser SQL Saturday Philadelphia 2019 TSQL Functions 42.

DB storage architectures: Rows and Columns

Why most Candidates fail the Interview in the first five Minutes

Why most Candidates fail the Interview in the first five Minutes

42 TSQL Functions =tg= Thomas Grohser SQL Saturday

Hybrid Buffer Pool The Good, the Bad and the Ugly

Visual Studio and SQL Server Data Tools

Presentation transcript:

Shaving of Microseconds =tg= Thomas Grohser Shaving of Microseconds

select * from =tg= where topic = @@Version Remark SQL 4.21 First SQL Server ever used (1994) SQL 6.0 First Log Shipping with failover SQL 6.5 First SQL Server Cluster (NT4.0 + Wolfpack) SQL 7.0 2+ billion rows / month in a single Table SQL 2000 938 days with 100% availability SQL 2000 IA64 First SQL Server on Itanium IA64 SQL 2005 IA64 First OLTP long distance database mirroring SQL 2008 IA64 First Replication into mirrored databases SQL 2008R2 IA64 SQL 2008R2 x64 First 256 CPUs & >500.000 STMT/sec First Scale out > 1.000.000 STMT/sec First time 1.2+ trillion rows in a table SQL 2012 > 220.000 Transactions per second > 1.3 Trillion Rows in a table SQL 2014 > 400.000 Transactions per second Fully automated deploy and management SQL 2016 AlwaysOn Automatic HA and DR, crossed the PB in storage SQL 2017 In production 3 minutes after RTM, HA for Replication SQL vNext Can’t wait to push the limits even further =tg= Thomas Grohser, NTT DATA UK Senior Director Technical Solutions Architecture email: tg@grohser.com Focus on SQL Server Security, Performance Engineering, Infrastructure and Architecture Wrote some of Close Relationship with SQLCAT (SQL Server Customer Advisory Team) SCAN (SQL Server Customer Advisory Network) TAP (Technology Adoption Program) Product Teams in Redmond Active PASS member and PASS Summit Speaker 24 Years with SQL Server

Agenda

Agenda What this talks is about and what not! Shaving of micro seconds Q&A

What this talk is NOT about

This talk is NOT about Query tuning Indexing strategies Schema design Hardware design In Memory OLTP How to make a 10 minute query run in 1 minute

This talk is about How to make a transaction that took 2ms run in 1ms Using very unusual tricks and some very obvious ones too…

WARNING

WARNING – WARNING - WARNING Most of the presentation is extreme and should NOT be taken as best practice. In most cases the techniques here will make little difference but introduce a lot of work. This is for the cases where this extra work is required

Why is this needed?

Why is this sometimes needed If you have a lot of tiny transactions every micro second counts. We have an auditing system that inserts 6 billion rows per day 6 000 000 000 rows A day has only 86 400 000 ms 2 ms per insert  139 cores 1 ms per insert  70 cores or ~½ million less in licenses

Let’s go

This is what we are going to make faster CREATE TABLE FastInserts ( ID int PRIMARY KEY CLUSTERED IDENTITY(1,1), RecordInsertTime datetime DEFAULT (GETDATE()) NULL, AValueToBeStored decimal(38,2) NULL, AFlagToBeStored int NULL, ATextToBeStored nvarchar(30) NULL, ANumberToBeStored bigint NULL ); INSERT INTO FastInserts (ANumberToBeStored, AValueToBeStored, AFlagToBeStored, ATextToBeStored) VALUES (123, 123.45, 1, '1234567890123456’);

Cleanup Column Names CREATE TABLE FastInserts ( ID int PRIMARY KEY CLUSTERED IDENTITY(1,1), RecordInsertTime datetime DEFAULT (GETDATE()) NULL, Amount decimal(38,2) NULL, Flag int NULL, CreditCardNr nvarchar(30) NULL, UserAccountID bigint NULL ); INSERT INTO FastInserts (UserAccountID, Amount, Flag, CreditCardNr) VALUES (123, 123.45, 1, '1234567890123456'); Understanding what data is stored in key to making it faster

The Obvious, Cleanup Data Types CREATE TABLE FastInserts ( ID bigint PRIMARY KEY CLUSTERED IDENTITY(1,1), RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NULL, Amount money NULL, Flag bit NULL, CreditCardNr varchar(16) NULL, UserAccountID int NULL ); INSERT INTO FastInserts (UserAccountID, Amount, Flag, CreditCardNr, TransactionTime) VALUES (123, 123.45, 1, '1234567890123456', '2018-01-19 12:30:00’);

The Obvious, Cleanup Data Types CREATE TABLE FastInserts ( ID bigint +4 RecordInsertTime datetime2(7) 0 Amount money -9 Flag bit -3 CreditCardNr varchar(16) -16, UserAccountID int -4 ); Total -28 bytes less on a 82 byte row This is up to 156 GB data or 33% less per day 1 ms per IO 256KB IO’s Checkpoint and 60KB log writes that’s 0.56 micro sec per row just on the IO or about 1h faster per day

One more byte: NOT NULL instead of NULL CREATE TABLE FastInserts ( ID bigint PRIMARY KEY CLUSTERED IDENTITY(1,1), RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NOT NULL, Amount money NOT NULL, Flag bit NOT NULL, CreditCardNr varchar(16) NOT NULL, UserAccountID int NOT NULL ); INSERT INTO FastInserts (UserAccountID, Amount, Flag, CreditCardNr) VALUES (123, 123.45, 1, '1234567890123456');

Four more bytes: varchar / char CREATE TABLE FastInserts ( ID bigint PRIMARY KEY CLUSTERED IDENTITY(1,1), RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NOT NULL, Amount money NOT NULL, Flag bit NOT NULL, CreditCardNr char(16) NOT NULL, UserAccountID int NOT NULL ); INSERT INTO FastInserts (UserAccountID, Amount, Flag, CreditCardNr) VALUES (123, 123.45, 1, '1234567890123456'); 4 bytes metadata + 16 bytes = 20 bytes with 16 the most common we save 4 bytes

Prefix Object Reference with Schema Name If not every time the query executes What is the default schema of the user If non specified what is the default schema of the DB This is two lookups on a very busy memory location requiring at least two latches and everyone else is waiting with a spinlock There is a reason in memory OLTP requires prefixes on all objects

Prefix Object Names Everywhere CREATE TABLE dbo.FastInserts ( ID bigint PRIMARY KEY CLUSTERED IDENTITY(1,1), RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NOT NULL, Amount money NOT NULL, Flag bit NOT NULL, CreditCardNr char(16) NOT NULL, UserAccountID int NOT NULL ); INSERT INTO dbo.FastInserts (UserAccountID, Amount, Flag, CreditCardNr) VALUES (123, 123.45, 1, '1234567890123456'); This should be best practice everywhere

Make the Primary Key Random CREATE TABLE dbo.FastInserts ( ID uniqueidentifier PRIMARY KEY CLUSTERED, UserAccountID int NOT NULL, RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NOT NULL, Amount money NOT NULL, Flag bit NOT NULL, CreditCardNr char(16) NOT NULL, Pad char(19) NOT NULL, ); INSERT INTO dbo.FastInserts (ID, UserAccountID, Amount, Flag, CreditCardNr) VALUES (NEWID(), 123, 123.45, 1, '1234567890123456');

Make the Primary Key Random With ever increasing key all inserts go to the end of the table creating a hotspot on the last page The more concurrent sessions the worst With ever increasing key at 300 concurrent sessions max of about 6000 inserts/second Random inserts go to over 180000 inserts/second on same hardware

Make the Primary Key Random

Make the row 32, 64, 128 bytes long CREATE TABLE dbo.FastInserts ( ID uniqueidentifier PRIMARY KEY CLUSTERED, UserAccountID int NOT NULL, RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NOT NULL, Amount money NOT NULL, Flag bit NOT NULL, CreditCardNr char(16) NOT NULL, Pad char(7) NOT NULL, ); INSERT INTO dbo.FastInserts (ID, UserAccountID, Amount, Flag, CreditCardNr) VALUES (NEWID(), 123, 123.45, 1, '1234567890123456'); Only helps later with reads, places rows in cache lines

Faster Query, place the SARGS in the front CREATE TABLE dbo.FastInserts ( ID bigint PRIMARY KEY CLUSTERED IDENTITY(1,1), UserAccountID int NOT NULL, RecordInsertTime datetime2(7) DEFAULT (GETDATE()) NOT NULL, Amount money NOT NULL, Flag bit NOT NULL, CreditCardNr char(16) NOT NULL, ); SELECT … FROM dbo.FastInserts WHERE UserAccountID = 123; If all SARGS are in the first 64 bytes they are all fetched with a single cache line

Use BIN2 Collations All comparisons are much faster String1 = String2 Instead of are_these_equal(String1, String2) But your TSQL is now case sensitive if you do it on the server level Consider doing it just on DB or even just column level

CPU Affinity / IO Affinity

NUMA

… Soft NUMA IP/Port IP/Port IP/Port IP/Port IP/Port Core + Cache Range of Data Range of Data Range of Data Range of Data Range of Data

Statement, Procedure, Prepared Statement INSERT INTO dbo.FastInserts (ID, UserAccountID, Amount, Flag, CreditCardNr) VALUES (NEWID(), 123, 123.45, 1, '1234567890123456’); EXEC dbo.InsertRow @P1 = 123, @P1 = 123.45, @P1 = 1, @P1 = '1234567890123456’ stmt_id = Prepare Statement( INSERT INTO dbo.FastInserts (ID, UserAccountID, Amount, Flag, CreditCardNr) VALUES (NEWID(), @P1, @P2, @P3, @P4); ) Execute id, @P1 = 123, @P2 = 123.45, @P3 = 1, @P4 = '1234567890123456'

Payload BINARY This is getting very extreme … CREATE TABLE dbo.FastInserts ( ID uniqueidentifier PRIMARY KEY CLUSTERED, UserAccountID int NOT NULL, Data BINARY(48) NOT NULL ); INSERT INTO dbo.FastInserts (ID, UserAccountID, Data) VALUES (NEWID(), 123, 0x4976961496149619846…); This is getting very extreme … Good luck reporting on this one

Shorten Names CREATE TABLE F.I ( I uniqueidentifier PRIMARY KEY CLUSTERED, U int NOT NULL, D BINARY(48) NOT NULL ); INSERT INTO F.I (I, U, D) VALUES (NEWID(), 123, 0x4976961496149619846…);

Batching Inserts / Using Bulk INSERT INTO dbo.FastInserts (ID, UserAccountID, Amount, Flag, CreditCardNr) VALUES (NEWID(), 123, 173.95, 1, ‘1739567890128406'), (NEWID(), 124, 264.85, 0, ‘2648608692937594'), (NEWID(), 125, 355.75, 1, ‘3557799794746687'), (NEWID(), 126, 446.65, 1, ‘4466887898555771'), (NEWID(), 127, 537.55, 0, ‘5375979990364869'), (NEWID(), 128, 628.45, 1, ‘6284567892173952'); or even better use the bulk insert API

Hardware High clock speed, disable cores to get faster cores Lots of cache Fast memory NV Memory for tail of the log NVMe disks direct attached for data and log Intelligent fast NIC (TCP offloading)

Thank You Q&A tg@grohser.com