SQL SERVER SSIS SQL Server Integration Services Know

  • Slides: 30
Download presentation
SQL SERVER - SSIS SQL Server Integration Services Know your data source well

SQL SERVER - SSIS SQL Server Integration Services Know your data source well

Who am I? �Nik – Shahriar Nikkhah �Microsoft MVP 2010 – SQL Server �MCITP

Who am I? �Nik – Shahriar Nikkhah �Microsoft MVP 2010 – SQL Server �MCITP SQL 2008 �MCTS SQL 2008 and 2005 Emails: SNikkhah@Live. ca SNikkhah@Yahoo. com msdn. microsoft. com (SSIS forum) One chapter on SSIS in MVP Deep dive 2 (Sep 2011)

OVERVIEW �Know your data source well / Data cleansing 1. Chronological file order 2.

OVERVIEW �Know your data source well / Data cleansing 1. Chronological file order 2. Data cleansing 3. Check a few sample packages �Error handling / Email notification 1. Capture error in a text file 2. Email error file as notification 3. One package sample � A package with the combination of the above.

Know your data source well �Analyze you data source from 2 different angles 1

Know your data source well �Analyze you data source from 2 different angles 1 - Data point of view • Data relations , field mapping, data value • PK, FK, Index, Metadata, Dictionary (mapping) tables • Good records and bad records (Redirecting) 2 - Data source behavior • Behavior changes (Table / file renaming and header names changes ) • Delivery process, how does the source get made, provided and loaded. (CSV been open by excel and saved) • Who is providing it.

Scenario on data behavior Data Point of view

Scenario on data behavior Data Point of view

Scenario on data behavior Data Point of view

Scenario on data behavior Data Point of view

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior Who is providing data source Files renamed

Scenario on data behavior Data source behavior Who is providing data source Files renamed and moved to different folders.

Daily file load statistics Perfect world Working No. of CVS / Excel, Excel Records

Daily file load statistics Perfect world Working No. of CVS / Excel, Excel Records Total no. days Packages Load & Sheets per sheet Records, Reload (1, 000) Million 21 100 1 -1 1 -3 1 K , 10 K 2. 1 - 63 Million record per day 0. 1 - 3

Daily file load statistics Real world Working No. of CVS / Excel, Excel Records

Daily file load statistics Real world Working No. of CVS / Excel, Excel Records Total no. days Packages Load & Sheets per sheet Records, Reload (1, 000) Million 21 100 3 -5 1 -3 1 K , 10 K 6. 3 - 315 Million record per day 0. 3 - 15 Loads Reloads Files loaded per month Monthly extra reload (Population reload) 6, 300 – 1 o, 500 files / month 2 – 3 reload a month = 12. 6 – 31. 5 files / month Forecast Packages for the next year New customers Extra 200 (sum of 300 per customer) 2 – 3 per year

Chronological file load � Over 99% of the ETLs that have a file as

Chronological file load � Over 99% of the ETLs that have a file as a source don’t use chronological file load in the SSIS package.

Chronological file load � Package overview.

Chronological file load � Package overview.

Chronological file load � Script that provides the files properties and information

Chronological file load � Script that provides the files properties and information

Chronological file load � Inside the DFT

Chronological file load � Inside the DFT

Chronological file load � Sort object

Chronological file load � Sort object

Chronological file load � Set flag

Chronological file load � Set flag

Chronological file load � Second For Each. Loop Display script

Chronological file load � Second For Each. Loop Display script

Data cleansing � Data cleansing and transformation � Data flow transformation includes a series

Data cleansing � Data cleansing and transformation � Data flow transformation includes a series of data cleansing tool such as � Joins � Fuzzy Lookups � Character mapping � Data type conversion � Derived columns � Set of Boolean functions for data comparisons and replacement

Data cleansing

Data cleansing

Data cleansing

Data cleansing

Data cleansing

Data cleansing

Data cleansing

Data cleansing

Error handling / Email notification �Keep track of your packages when an error occurs

Error handling / Email notification �Keep track of your packages when an error occurs �Organize your error files �Backup in the right folder �Display the right Error message. �Send a notification message to the right person �The subject of the email must be clear

Capture error files in a text file

Capture error files in a text file

Capture error files in a text file

Capture error files in a text file

Capture error files in a text file SEE ATTACHED SAMPLE

Capture error files in a text file SEE ATTACHED SAMPLE

Email notification Use SSIS Variables to set your SMTP object SEE ATTACHED SAMPLE

Email notification Use SSIS Variables to set your SMTP object SEE ATTACHED SAMPLE

THANK YOU

THANK YOU