bicortex

How to create a simple ETL/stored procedure error capturing database (schema model, SQL code and implementation) – Part 2

November 7th, 2014 / 3 Comments » / by admin

In PART 1 of this series I analysed the execution error tracking database schema as well as the code for ‘scraping’ SQL Server metadata in order to populate the database tables with instance, database, schema and objects-specific information. In this part I will briefly describe SQL code modifications which are applied the original stored procedure (see PART 1) to customize it in order to handle metadata changes as well as the actual implementation and error-logging mechanism during package/stored procedure execution. Also, all the code and any additional files presented in this and previous part can be downloaded from my OneDrive folder HERE.

Before I get into the nuts and bolts on how to log into LogSSISErrors_Error table, which in this case will form the base for all errors entries, let create a sample database, schema, table and stored procedure containing an error-invoking code i.e. executing division by 0. This stored procedure, which will later be appended with additional error-handling code it in order to capture and log execution error(s) through TRY/CATCH T-SQL, will be used as the basis for demonstrating error capturing mechanism.

USE master
IF EXISTS ( SELECT TOP 1
                    *
            FROM    sys.databases
            WHERE   name = 'TestDB' )
    DROP DATABASE TestDB
CREATE DATABASE TestDB
GO
USE TestDB
EXEC sp_executesql N'CREATE SCHEMA dummyschema;';
GO
CREATE TABLE dummyschema.DummyTable
    (
      ID INT IDENTITY(1, 1) ,
      Column1 VARCHAR(56) ,
      Column2 VARCHAR(56) ,
      Column3 VARCHAR(56)
    )
INSERT  INTO dummyschema.DummyTable
        ( Column1 ,
          Column2 ,
          Column3
        )
        SELECT  'dummy value1' ,
                'dummy value1' ,
                'dummy value1'
        UNION ALL
        SELECT  'dummy value1' ,
                'dummy value1' ,
                'dummy value1'
        UNION ALL
        SELECT  'dummy value1' ,
                'dummy value1' ,
                'dummy value1' 
GO
CREATE PROCEDURE dummyschema.usp_DummySP
    (
      @ExecutionInstanceGUID UNIQUEIDENTIFIER ,
      @PackageName NVARCHAR(256)
    )
AS
    BEGIN
        INSERT  INTO TestDB.dummyschema.DummyTable
                    ( Column1 )
                    SELECT  1 / 0
    END

As this database is supposed to be working mainly in conjunction with an SSIS package (standalone stored procedure execution error capture is also fine but in this case the SSIS component would have to be omitted and replaced with a place-holder value), I have also created a sample package used primarily to trigger stored procedure execution, with a couple of system variables capturing package execution instance GUID identifier and package name metadata. This package needs to be saved somewhere on the C:\ drive for the metadata updating stored procedure to find the relevant info and populate LogSSISErrors_Package table accordingly.

Most of that code from PART 1 is also applicable to maintaining the error tracking database by means of modifying INSERT statements into MERGE statements, thus allowing for applying changes to already existing data and inserting new records. The core difference is that in PART 1 all AdminSQL database objects required to be (re)created before we could proceed with SQL Server instance metadata analysis and capture thus making its execution only applicable to a new environment, where the database needed to be rebuilt from scratch. Once SQLAdmin database has been created and initially populated with the code from PART 1, any subsequent execution should account for updates and new data inserts only which are implemented by means of subtle changes to the stored procedure code, mainly replacing INSERTS with MERGE T-SQL. Below is a couple of examples where the code from PART 1 stored procedure was replaced with MERGE SQL statements with the complete code available for download from HERE.

-------------------------------------------------------------------------------------------------------
--Update LogSSISErrors_Schema table
-------------------------------------------------------------------------------------------------------	
MERGE INTO AdminDBA.dbo.LogSSISErrors_Schema AS t
USING
    ( SELECT  DISTINCT
                db.ID AS ID ,
                meta.SchemaID AS SchemaID ,
                meta.SchemaName AS SchemaName ,
                meta.SchemaOwner AS SchemaOwner ,
                1 AS CurrentlyUsed
      FROM      @EnumDBMeta meta
                JOIN AdminDBA.dbo.LogSSISErrors_DB db ON meta.DBName = db.DBName
      WHERE     db.CurrentlyUsed = 1
    ) s
ON ( t.SchemaName = s.SchemaName
     AND s.ID = t.FKDBID
   )
WHEN MATCHED THEN
    UPDATE SET [SchemaID] = s.[SchemaID] ,
               [SchemaName] = s.[SchemaName] ,
               [SchemaOwner] = s.[SchemaOwner] ,
               [CurrentlyUsed] = s.[CurrentlyUsed]
WHEN NOT MATCHED THEN
    INSERT ( [FKDBID] ,
             [SchemaID] ,
             [SchemaName] ,
             [SchemaOwner] ,
             [CurrentlyUsed]
           )
    VALUES ( s.[ID] ,
             s.[SchemaID] ,
             s.[SchemaName] ,
             s.[SchemaOwner] ,
             s.[CurrentlyUsed]
           )
WHEN NOT MATCHED BY SOURCE THEN
    UPDATE SET [CurrentlyUsed] = 0;


-------------------------------------------------------------------------------------------------------
--Update LogSSISErrors_Process table
-------------------------------------------------------------------------------------------------------	
MERGE INTO AdminDBA.dbo.LogSSISErrors_Process AS t
USING
    ( SELECT DISTINCT
                meta.ProcessObjectID ,
                meta.ProcessObjectName ,
                sch.ID ,
                1 AS CurrentlyUsed
      FROM      @EnumDBMeta meta
                JOIN AdminDBA.dbo.LogSSISErrors_Schema AS sch ON sch.SchemaName = meta.SchemaName
                                                              AND meta.ProcessObjectSchemaID = sch.SchemaID
                JOIN AdminDBA.dbo.LogSSISErrors_DB db ON meta.DBName = db.DBName
                                                         AND sch.FKDBID = db.ID
      WHERE     sch.CurrentlyUsed = 1
                AND db.CurrentlyUsed = 1
                AND ProcessObjectType IN ( 'SQL Stored Procedure',
                                           'Aggregate Function',
                                           'SQL DML Trigger',
                                           'Assembly DML Trigger',
                                           'Extended Stored Procedure',
                                           'Assembly Stored Procedure',
                                           'Replication Filter Procedure',
                                           'Assembly Scalar Function',
                                           'SQL Scalar Function' )
    ) s
ON ( t.ProcessName = s.ProcessObjectName
     AND s.ID = t.FKSchemaID
   )
WHEN MATCHED THEN
    UPDATE SET [ProcessID] = s.ProcessObjectID ,
               [ProcessName] = s.ProcessObjectName ,
               [CurrentlyUsed] = s.[CurrentlyUsed]
WHEN NOT MATCHED THEN
    INSERT ( [ProcessID] ,
             [ProcessName] ,
             [FKSchemaID] ,
             [CurrentlyUsed]
           )
    VALUES ( s.ProcessObjectID ,
             s.ProcessObjectName ,
             s.ID ,
             s.[CurrentlyUsed]
           )
WHEN NOT MATCHED BY SOURCE THEN
    UPDATE SET [CurrentlyUsed] = 0;

Upon execution of the modified stored procedure from PART 1, the code should pick up our new database, schema, table and stored procedure which in turn should be reflected in AdminDBA database tables’ entries i.e. LogSSISErrors_Package, LogSSISErrors_DB, LogSSISErrors_Schema, LogSSISErrors_Object, and LogSSISErrors_Process tables as per the image below.

When trying to execute the package, an error message is thrown due to T-SQL in usp_DummySP stored procedure attempting to divide by zero, which is in accordance with our expectations. The stored procedure does not contain error handling code therefore division by zero operation is handled by halting execution and returning the error message back to Execution Results view as per below. Also, the package invoking the stored procedure needs to be saved

Now let’s alter the dummyschema.usp_DummySP stored procedure’s code to allow error capture and handling capabilities. By default SQL Server has a number of system functions available to troubleshoot execution flow and expose error details. In combination with SSIS package system parameters i.e. execution instance GUID and package name, we can capture and log those details in LogSSISErrors_Error table, referencing other tables which provide other metadata details for richer analysis.

The following ALTER SQL statements implements TRY/CATCH SQL Server error handling capabilities and allows for logging execution issues e.g. dividing by zero in the highlighted SELECT statement section.

USE [TestDB]
GO
ALTER PROCEDURE [dummyschema].[usp_DummySP]
    (
      @ExecutionInstanceGUID UNIQUEIDENTIFIER ,
      @PackageName NVARCHAR(256)
    )
AS
    BEGIN 
        BEGIN TRY
            BEGIN TRANSACTION		
            INSERT  INTO TestDB.dummyschema.DummyTable
                    ( Column1 )
                    SELECT  1 / 0
            COMMIT TRANSACTION
        END TRY
        BEGIN CATCH
            ROLLBACK TRANSACTION;
            WITH    TempErr ( [ErrorNumber], [ErrorSeverity], [ErrorState], [ErrorLine], [ErrorMessage], [ErrorDateTime], 
							[LoginName], [UserName], PackageName, [ObjectID], [ProcessID], [ExecutionInstanceGUID], [DBName] )
                      AS ( SELECT   ERROR_NUMBER()		AS ErrorNumber ,
                                    ERROR_SEVERITY()	AS ErrorSeverity ,
                                    ERROR_STATE()		AS ErrorState ,
                                    ERROR_LINE()		AS ErrorLine ,
                                    ERROR_MESSAGE()		AS ErrorMessage ,
                                    SYSDATETIME()		AS ErrorDateTime ,
                                    SYSTEM_USER			AS LoginName ,
                                    USER_NAME()			AS UserName ,
                                    @PackageName ,
                                    OBJECT_ID('TestDB.dummyschema.DummyTable') AS ObjectID ,
                                    ( SELECT    a.objectid
                                      FROM      sys.dm_exec_requests r
                                                CROSS	APPLY sys.dm_exec_sql_text(r.sql_handle) a
                                      WHERE     session_id = @@spid
                                    ) AS ProcessID ,
                                    @ExecutionInstanceGUID AS ExecutionInstanceGUID ,
									DB_NAME()			AS DatabaseName
                         )
                INSERT  INTO AdminDBA.dbo.LogSSISErrors_Error
                        ( [ErrorNumber] ,
                          [ErrorSeverity] ,
                          [ErrorState] ,
                          [ErrorLine] ,
                          [ErrorMessage] ,
                          [ErrorDateTime] ,
                          [FKLoginID] ,
                          [FKUserID] ,
                          [FKPackageID] ,
                          [FKObjectID] ,
                          [FKProcessID] ,
                          [ExecutionInstanceGUID]
                        )
                        SELECT  ErrorNumber				= COALESCE(err.ErrorNumber, -1) ,
                                ErrorSeverity			= COALESCE(err.[ErrorSeverity], -1) ,
                                ErrorState				= COALESCE(err.[ErrorState], -1) ,
                                ErrorLine				= COALESCE(err.[ErrorLine], -1) ,
                                ErrorMessage			= COALESCE(err.[ErrorMessage], 'Unknown') ,
                                ErrorDateTime			= ErrorDateTime ,
                                FKLoginID				= src_login.ID ,
                                FKUserID				= src_user.ID ,
                                [FKPackageID]			= src_package.ID ,
                                [FKObjectID]			= src_object.ID ,
                                [FKProcessID]			= src_process.ID ,
                                [ExecutionInstanceGUID] = err.ExecutionInstanceGUID		
                        FROM    TempErr err
                                LEFT JOIN AdminDBA.dbo.LogSSISErrors_Login src_login
								ON err.LoginName = src_login.LoginName
                                LEFT JOIN AdminDBA.dbo.LogSSISErrors_User src_user 
								ON err.UserName = src_user.UserName	
								AND src_user.FKDBID = 
									(SELECT ID 
									FROM AdminDBA.dbo.LogSSISErrors_DB db 
									WHERE db.DBName = err.DBName)						
                                LEFT JOIN AdminDBA.dbo.LogSSISErrors_Package src_package 
								ON err.PackageName = (LEFT(src_package.PackageName, CHARINDEX('.', src_package.PackageName) - 1)) 
                                LEFT JOIN AdminDBA.dbo.LogSSISErrors_Object src_object 
								ON err.ObjectID = src_object.ObjectID
                                LEFT JOIN AdminDBA.dbo.LogSSISErrors_Process src_process 
								ON err.ProcessID = src_process.ProcessID
								--WHERE 
								--src_login.CurrentlyUsed	= 1 AND
								--src_user.CurrentlyUsed	= 1 AND 
								--src_package.CurrentlyUsed	= 1 AND
								--src_object.CurrentlyUsed	= 1 AND
								--src_process.CurrentlyUsed	= 1															
        END CATCH
    END

Once altered, the stored procedure executed through the package will write into the table if BEGIN CATCH/END CATCH statement is invoked as per image below.

Now that error details have been inserted into our AdminDBA database table, we can query the data across all related tables if more comprehensive view is required e.g. schema, database etc. Below sample query sets up a view which can be used through a reporting platform of choice to quickly identify transactional SSIS execution issues or perform a quick analysis on errors and their related objects, schemas, processes, packages etc.

CREATE VIEW [dbo].[vw_LogSSISErrors_CoreData] AS
SELECT  dbo.LogSSISErrors_Error.ID AS Error_ID ,
        dbo.LogSSISErrors_Error.ErrorNumber ,
        dbo.LogSSISErrors_Error.ErrorSeverity ,
        dbo.LogSSISErrors_Error.ErrorState ,
        dbo.LogSSISErrors_Error.ErrorLine ,
        dbo.LogSSISErrors_Error.ErrorMessage ,
        dbo.LogSSISErrors_Error.ErrorDateTime ,
        dbo.LogSSISErrors_Object.ObjectName ,
        dbo.LogSSISErrors_Object.ObjectType ,
        dbo.LogSSISErrors_Object.ObjectDescription ,
        dbo.LogSSISErrors_Schema.SchemaName AS ObjectSchemaName ,
        dbo.LogSSISErrors_Process.ProcessName ,
        LogSSISErrors_Schema_1.SchemaName AS ProcessSchemaName ,
        dbo.LogSSISErrors_Package.PackageName ,
        dbo.LogSSISErrors_User.UserName ,
        dbo.LogSSISErrors_Login.LoginName ,
        dbo.LogSSISErrors_DB.DBName ,
        dbo.LogSSISErrors_Instance.InstanceName
FROM    dbo.LogSSISErrors_Error
        INNER JOIN dbo.LogSSISErrors_Object ON dbo.LogSSISErrors_Error.FKObjectID = dbo.LogSSISErrors_Object.ID
        INNER JOIN dbo.LogSSISErrors_Schema ON dbo.LogSSISErrors_Object.FKSchemaID = dbo.LogSSISErrors_Schema.ID
        INNER JOIN dbo.LogSSISErrors_DB ON dbo.LogSSISErrors_Schema.FKDBID = dbo.LogSSISErrors_DB.ID
        INNER JOIN dbo.LogSSISErrors_Instance ON dbo.LogSSISErrors_DB.FKInstanceID = dbo.LogSSISErrors_Instance.ID
        INNER JOIN dbo.LogSSISErrors_Package ON dbo.LogSSISErrors_Error.FKPackageID = dbo.LogSSISErrors_Package.ID
        INNER JOIN dbo.LogSSISErrors_Process ON dbo.LogSSISErrors_Error.FKProcessID = dbo.LogSSISErrors_Process.ID
        INNER JOIN dbo.LogSSISErrors_User ON dbo.LogSSISErrors_Error.FKUserID = dbo.LogSSISErrors_User.ID
                                             AND dbo.LogSSISErrors_DB.ID = dbo.LogSSISErrors_User.FKDBID
        INNER JOIN dbo.LogSSISErrors_Schema AS LogSSISErrors_Schema_1 ON dbo.LogSSISErrors_DB.ID = LogSSISErrors_Schema_1.FKDBID
                                                              AND dbo.LogSSISErrors_Process.FKSchemaID = LogSSISErrors_Schema_1.ID
        INNER JOIN dbo.LogSSISErrors_Login ON dbo.LogSSISErrors_Error.FKLoginID = dbo.LogSSISErrors_Login.ID;

Based on the output of this code, we can set up a simple dashboard (Tableau in this case) to provide us with some core metrics on the ETL issues that require addressing at a glance (click on image to enlarge).

You can also download all the code samples, SSIS package solution files and database schema model (saved in a format compatible with Navicat Data Modeler database design tool) from my OneDrive folder HERE.

Posted in: How To's, SQL

Tags: Data Modelling, SQL, SQL Server

How to create a simple ETL/stored procedure error capturing database (schema model, SQL code and implementation) – Part 1

November 5th, 2014 / 2 Comments » / by admin

Introduction

Two part series (second part can be found HERE) on how to create and implement an ETL execution error capturing database (schema model, SQL code, implementation etc.) based on Microsoft SQL Server stored procedures default error handling capabilities.

TL;DR Version

My last client engagement required me to create an ETL errors tracking solution which would serve as a central repository for all data related to SSIS packages execution failures. Given the initial scope and assumptions pointed to a small project size, thus the number of ETL routines involved kept at a minimum, my first intention was to build a single table encompassing all key information related to the package metadata and execution state. However, as the project grew in size, execs started to demand richer, more detailed data and the constricting schema of a single table became too evident, I decided to expand it with a proper relational database schema with additional metadata attributes at all common data-grouping hierarchies i.e. instance, database, schema and objects. In addition, I have also extracted metadata pertaining to users, logins and packages which complemented the whole data set with a comprehensive suite of variables for error-related analysis. Please note that this model and the subsequently discussed implementation is based primarily on capturing errors resulting from stored procedures execution as per my project demands. Since version 2012, SQL Server included the SSISDB catalog as the central point for working with Integration Services (SSIS) projects deployed to the Integration Services server which contains a rich set of data for all execution eventualities and should be treated as a default logging option for comprehensive SSIS runtime events capture.

Below is a breakdown of a database schema model which accounts for most of the database-related objects as well as some handy information on individual packages, logins and users. First up, we have instance and database data specific tables – LogSSISErrors_Instance and LogSSISErrors_DB – in a one-to-many relationship i.e. one SQL Server instance to multiple databases.

As each database can have multiple schemas, LogSSISErrors_DB has a one-to-many relationship to LogSSISErrors_Schema table, which in turn can be further related to LogSSISErrors_Object and LogSSISErrors_Process tables, each storing object-level data. The only distinction between ‘Object’ and ‘Process’ specific tables is the fact that ‘Object’ table stores table and view metadata whereas ‘Process’ table is concerned with stored procedures, assemblies, functions and triggers, even though on the database level they are all technically treated as objects.

Both, LogSSISErrors_Object and LogSSISErrors_Process have a one-to-many relationship with LogSSISErrors_Error table – the centrepiece of all the logging activities. LogSSISErrors_Error table is also linked up with LogSSISErrors_User, LogSSISErrors_Package and LogSSISErrors_Login tables, storing users, local SSIS packages and logins metadata.

LogSSISErrors_user table is also connected to LogSSISErrors_DB table as one database can have multiple users created against it.

Full entity relational diagram for AdminDBA is as per the image below.

All the data represented by the above schema, with the exception of LogSSISErrors_Package table entries, should be available by querying either catalog views, information schema views or system stored procedures/functions. Most of the tables names are self-explanatory and can be easily referred back to the function they play and relevant metadata information. In case of LogSSISErrors_Process and LogSSISError_Object, these tables’ data comes from the same query with the distinction made on the object type i.e. view and tables, whereas stored procedures are bucketed as processes (even though technically they are objects too), thus placed in the LogSSISErrors_Process table. In case of LogSSISErrors_Package table, the package metadata is not stored on the server level but can be sourced from a .dtsx file using T-SQL. This part of solution was made available thanks to Jamie Thompson’s scripts which you can read about in detail in his post HERE, I simply adopted his technique for my solution.

The script used to create all database objects and populate those with SQL Server instance metadata is close to 1800 lines long so in this post I’ll only go through sections of it, skipping all the boilerplate SQL; however, a full working version is available for download from my OneDrive folder HERE. The script is divided into a few sections, main parts being AdminDBA database metadata collection for subsequent objects recreation and second part determined by a conditional logic where a counts of foreign key constraints and a count of tables from two table variables created is compared.

First part creates two table variables storing objects AdminDBA objects metadata and foreign key constraints referencing individual tables. If those tables have already been created and relationships exist, than the rest of the script utilizes this metadata in subsequent execution. The following T-SQL generates table variables and their content.

DECLARE @TempFKs TABLE
    (
      ID INT IDENTITY(1, 1)
             NOT NULL ,
      Foreign_Key_Name VARCHAR(512) ,
      Parent_Object_Name VARCHAR(256) ,
      Referenced_Object_Name VARCHAR(256) ,
      Parent_Column_Name VARCHAR(256) ,
      Referenced_Column_Name VARCHAR(256)
    )				

INSERT  INTO @TempFKs
        ( Foreign_Key_Name ,
          Parent_Object_Name ,
          Referenced_Object_Name ,
          Parent_Column_Name ,
          Referenced_Column_Name
        )
        SELECT  fk.name AS foreign_key_name ,
                po.name AS parent_object_name ,
                ro.name AS referenced_object_name ,
                pc.name AS parent_column_name ,
                rc.name AS referenced_column_name
        FROM    sys.foreign_key_columns fc
                JOIN sys.columns pc ON pc.column_id = parent_column_id
                                       AND parent_object_id = pc.object_id
                JOIN sys.columns rc ON rc.column_id = referenced_column_id
                                       AND referenced_object_id = rc.object_id
                JOIN sys.objects po ON po.object_id = pc.object_id
                JOIN sys.objects ro ON ro.object_id = rc.object_id
                JOIN sys.foreign_keys fk ON fk.object_id = fc.constraint_object_id
        WHERE   ro.type = 'U'
                AND fk.type = 'F'								

DECLARE @TempTbls TABLE
    (
      ID INT IDENTITY(1, 1)
             NOT NULL ,
      Table_Name VARCHAR(128) NOT NULL ,
      Table_Schema_Definition VARCHAR(MAX) NOT NULL
    )				

INSERT  INTO @TempTbls
        ( Table_Name ,
          Table_Schema_Definition
        )
        SELECT  so.name AS Table_Name ,
                'create table [' + so.name + '] (' + STUFF(o.list, LEN(o.list),
                                                           1, '') + ')'
                + CASE WHEN tc.CONSTRAINT_NAME IS NULL THEN ''
                       ELSE 'ALTER TABLE ' + so.name + ' ADD CONSTRAINT '
                            + tc.CONSTRAINT_NAME + ' PRIMARY KEY ' + ' ('
                            + LEFT(j.list, LEN(j.list) - 1) + ')'
                  END AS Table_Schema_Definition
        FROM    sysobjects so
                CROSS APPLY ( SELECT    '  [' + COLUMN_NAME + '] ' + DATA_TYPE
                                        + CASE DATA_TYPE
                                            WHEN 'sql_variant' THEN ''
                                            WHEN 'text' THEN ''
                                            WHEN 'ntext' THEN ''
                                            WHEN 'xml' THEN ''
                                            WHEN 'decimal'
                                            THEN '('
                                                 + CAST(NUMERIC_PRECISION AS VARCHAR)
                                                 + ', '
                                                 + CAST(NUMERIC_SCALE AS VARCHAR)
                                                 + ')'
                                            ELSE COALESCE('('
                                                          + CASE
                                                              WHEN CHARACTER_MAXIMUM_LENGTH = -1
                                                              THEN 'MAX'
                                                              ELSE CAST(CHARACTER_MAXIMUM_LENGTH AS VARCHAR)
                                                            END + ')', '')
                                          END + ' '
                                        + CASE WHEN EXISTS ( SELECT
                                                              id
                                                             FROM
                                                              syscolumns
                                                             WHERE
                                                              OBJECT_NAME(id) = so.name
                                                              AND name = column_name
                                                              AND COLUMNPROPERTY(id,
                                                              name,
                                                              'IsIdentity') = 1 )
                                               THEN 'IDENTITY('
                                                    + CAST(IDENT_SEED(so.name) AS VARCHAR)
                                                    + ','
                                                    + CAST(IDENT_INCR(so.name) AS VARCHAR)
                                                    + ')'
                                               ELSE ''
                                          END + ' '
                                        + ( CASE WHEN IS_NULLABLE = 'No'
                                                 THEN 'NOT '
                                                 ELSE ''
                                            END ) + 'NULL '
                                        + CASE WHEN INFORMATION_SCHEMA.COLUMNS.COLUMN_DEFAULT IS NOT NULL
                                               THEN 'DEFAULT '
                                                    + INFORMATION_SCHEMA.COLUMNS.COLUMN_DEFAULT
                                               ELSE ''
                                          END + ', '
                              FROM      INFORMATION_SCHEMA.COLUMNS
                              WHERE     TABLE_NAME = so.name
                              ORDER BY  ORDINAL_POSITION
                            FOR
                              XML PATH('')
                            ) o ( list )
                LEFT JOIN INFORMATION_SCHEMA.TABLE_CONSTRAINTS tc ON tc.TABLE_NAME = so.name
                                                              AND tc.CONSTRAINT_TYPE = 'PRIMARY KEY'
                CROSS APPLY ( SELECT    '[' + COLUMN_NAME + '], '
                              FROM      INFORMATION_SCHEMA.KEY_COLUMN_USAGE kcu
                              WHERE     kcu.CONSTRAINT_NAME = tc.CONSTRAINT_NAME
                              ORDER BY  ORDINAL_POSITION
                            FOR
                              XML PATH('')
                            ) j ( list )
        WHERE   xtype = 'U'
                AND name NOT IN ( 'dtproperties' )

By design, after the the database and objects have been (re)created, the number of foreign key constraints should be one fewer than the number of tables. The below piece of SQL compares the counts of foreign key constraints and the count of tables from two table variables at which point one of the two possible scenarios can be triggered – if the count is comparable than the objects are re-created using the data encapsulated in table variables which already store DDL code for individual tables as well as foreign key constraints. If, on the other hand, the count is not comparable, a different part of stored procedure gets executed, this time responsible for recreating objects based on the hard-coded DDL statements and foreign key constraints.

IF NOT EXISTS ( SELECT  COUNT(*) - 1 AS ct
                FROM    @TempFKs a
                EXCEPT
                SELECT  COUNT(*)
                FROM    @TempTbls b )
		AND EXISTS ( SELECT TOP 1 1 FROM @TempTbls )

Most of the SQL Server databases’ metadata comes from storing the results of the below query which extracts database, schema and objects information for each database and temporarily stores it in a table variable.

SET @SQL = N'';
SELECT
@SQL = @SQL + ' UNION ALL
SELECT
[database]					=	''' + name	+ ''',
[schema]					=	s.name COLLATE Latin1_General_CI_AI,
[schemaid]					=	s.schema_id,
[schemaowner]				=	dp.name COLLATE Latin1_General_CI_AI,
[processobjectname]			=	ob.name COLLATE Latin1_General_CI_AI,
[processobjectid]			=	ob.object_id,
[processobjectschemaid]		=	ob.schema_id,
[ProcessObjectDescription]	=	NULL,
[processobjecttype]			=	CASE
								WHEN ob.type =	''AF''	THEN	''Aggregate Function''
								WHEN ob.type =	''C''	THEN	''Check Constraint''
								WHEN ob.type =	''D''	THEN	''Default Constraint''
								WHEN ob.type =	''F''	THEN	''Foreign Key Constraint''
								WHEN ob.type =	''FN''	THEN	''SQL Scalar Function''
								WHEN ob.type =	''FS''	THEN	''Assembly Scalar Function''
								WHEN ob.type =	''FT''	THEN	''Assembly Table Valued Function''
								WHEN ob.type =	''IF''	THEN	''SQL Inline Table-valued Function''
								WHEN ob.type =	''IT''	THEN	''Internal Table''
								WHEN ob.type =	''P''	THEN	''SQL Stored Procedure''
								WHEN ob.type =	''PC''	THEN	''Assembly Stored Procedure''
								WHEN ob.type =	''PG''	THEN	''Plan Guide''
								WHEN ob.type =	''PK''	THEN	''Primary Key Constraint''
								WHEN ob.type =	''R''	THEN	''Rule''
								WHEN ob.type =	''RF''	THEN	''Replication Filter Procedure''
								WHEN ob.type =	''S''	THEN	''System Base Table''
								WHEN ob.type =	''SN''	THEN	''Synonym''
								WHEN ob.type =	''SO''	THEN	''Sequence Object''
								WHEN ob.type =	''SQ''	THEN	''Service Queue''
								WHEN ob.type =	''TA''	THEN	''Assembly DML Trigger''
								WHEN ob.type =	''TF''	THEN	''SQL Table Table Valued Function''
								WHEN ob.type =	''TR''	THEN	''SQL DML Trigger''
								WHEN ob.type =	''TT''	THEN	''Table Type''
								WHEN ob.type =	''U''	THEN	''User Defined Table''
								WHEN ob.type =	''UQ''	THEN	''Unique Constraint''
								WHEN ob.type =	''V''	THEN	''View''
								WHEN ob.type =	''X''	THEN	''Extended Stored Procedure''
								ELSE							''Unknown Object Type''
								END,
[createdate]				=	CAST(ob.create_date as date),
[modifieddate]				=	CAST(ob.modify_date as date)
FROM ' + QUOTENAME(name) + '.sys.schemas AS s
INNER JOIN ' + QUOTENAME(name)
+ '.sys.database_principals dp
on s.principal_id = dp.principal_id
INNER JOIN ' + QUOTENAME(name) + '.sys.objects AS ob
ON s.[schema_id] = ob.[schema_id]
where s.name <> ''sys'''
FROM    sys.databases
WHERE   database_id > 4
        AND name NOT IN ( 'ReportServer$SQL2014_MULTIDIM',
                          'ReportServer$SQL2014_MULTIDIMTempDB' );
SET @SQL = @SQL + N' ORDER BY [database],[schema];';
SET @SQL = STUFF(@SQL, 1, 11, '');

DECLARE @EnumDBMeta TABLE
    (
      ID INT NOT NULL
             IDENTITY(1, 1) ,
      DBName NVARCHAR(128) ,
      SchemaName NVARCHAR(128) ,
      SchemaID INT ,
      SchemaOwner NVARCHAR(128) ,
      ProcessObjectName NVARCHAR(128) ,
      ProcessObjectID INT ,
      ProcessObjectSchemaID INT ,
      ProcessObjectDescription NVARCHAR(2048) ,
      ProcessObjectType NVARCHAR(128) ,
      ProcessObjectCreatedDate DATE ,
      ProcessObjectModifiedDate DATE
    )
INSERT  INTO @EnumDBMeta
        ( DBName ,
          SchemaName ,
          SchemaID ,
          SchemaOwner ,
          ProcessObjectName ,
          ProcessObjectID ,
          ProcessObjectSchemaID ,
          ProcessObjectDescription ,
          ProcessObjectType ,
          ProcessObjectCreatedDate ,
          ProcessObjectModifiedDate
        )
EXEC sp_executesql @SQL

Instance level metadata comes mainly from appending different arguments to a SERVERPROPERTY function to return instance-specific information. Database users metadata comes from sys.database_principals, whereas login details are returned from sys.server_principals catalog views.

SSIS package metadata is a little bit more complex to interrogate. In case of LogSSISErrors_Package table, the package metadata is not stored on the server level but can be sourced from a .dtsx file using T-SQL. As mentioned before, this part of solution was made available thanks to Jamie Thompson’s scripts which you can read about in detail in his post HERE. This part of script accesses the file system and reads .dtsx XML content, storing relevant attributes in LogSSISErrors_Package table.

EXEC sp_configure 'show advanced options', 1;
RECONFIGURE;
EXEC sp_configure 'xp_cmdshell', 1;
RECONFIGURE;

DECLARE @Path VARCHAR(2000);
SET @Path = 'C:\*.dtsx';
		--Must be of form [drive letter]\...\*.dtsx

DECLARE @MyFiles TABLE
    (
      MyID INT IDENTITY(1, 1)
               PRIMARY KEY ,
      FullPath VARCHAR(2000)
    );
DECLARE @CommandLine VARCHAR(4000);

SELECT  @CommandLine = LEFT('dir "' + @Path + '" /A-D /B /S ', 4000);
INSERT  INTO @MyFiles
        ( FullPath )
        EXECUTE xp_cmdshell @CommandLine;
DELETE  FROM @MyFiles
WHERE   FullPath IS NULL
        OR FullPath = 'File Not Found'
        OR FullPath = 'The system cannot find the path specified.'
        OR FullPath = 'The system cannot find the file specified.'; 

IF EXISTS ( SELECT  *
            FROM    sys.tables
            WHERE   name = N'pkgStats' )
    DROP	TABLE pkgStats;
CREATE	 TABLE pkgStats
    (
      PackagePath VARCHAR(900) NOT NULL
                               PRIMARY KEY ,
      PackageXML XML NOT NULL
    );

DECLARE @FullPath VARCHAR(2000);
DECLARE file_cursor CURSOR
FOR
    SELECT  FullPath
    FROM    @MyFiles;
OPEN	file_cursor
FETCH NEXT FROM file_cursor INTO @FullPath;
WHILE @@FETCH_STATUS = 0
    BEGIN
        SET @SQL = '
					INSERT	pkgStats (PackagePath,PackageXML)
		select  ''@FullPath'' as PackagePath
		,		cast(BulkColumn as XML) as PackageXML
		from    openrowset(bulk ''@FullPath'',
								single_blob) as pkgColumn';
        SELECT  @SQL = REPLACE(@SQL, '@FullPath', @FullPath);
        EXEC sp_executesql @SQL;

        FETCH NEXT FROM file_cursor INTO @FullPath;
    END
CLOSE	file_cursor;
DEALLOCATE file_cursor;

DECLARE @pkgStatsBase TABLE
    (
      PackagePath VARCHAR(900) ,
      PackageId UNIQUEIDENTIFIER ,
      PackageCreatorsName NVARCHAR(500) ,
      PackageFormatVersion SMALLINT ,
      PackageType NVARCHAR(50) ,
      PackageDescription NVARCHAR(2000) ,
      PackageVersionMajor SMALLINT ,
      PackageVersionMinor SMALLINT ,
      PackageVersionBuild SMALLINT ,
      PackageVersionGUID UNIQUEIDENTIFIER ,
      PackageXML XML
    );

INSERT  INTO @pkgStatsBase
        SELECT  PackagePath ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''DTSID''][1]',
                                      'nvarchar(500)') AS UNIQUEIDENTIFIER) AS PackageID ,
                PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''CreatorName''][1]',
                                 'nvarchar(500)') AS PackageCreatorsName ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''PackageFormatVersion''][1]',
                                      'varchar(3)') AS SMALLINT) AS PackageFormatVersion ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							DTS:Executable[1]/@DTS:ExecutableType[1]',
                                      'nvarchar(50)') AS NVARCHAR(50)) AS PackageType ,
                PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''Description''][1]',
                                 'nvarchar(2000)') AS PackageDescription ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''VersionMajor''][1]',
                                      'varchar(3)') AS SMALLINT) AS PackageVersionMajor ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''VersionMinor''][1]',
                                      'varchar(3)') AS SMALLINT) AS PackageVersionMinor ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''VersionBuild''][1]',
                                      'varchar(3)') AS SMALLINT) AS PackageVersionBuild ,
                CAST(PackageXML.value('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
							/DTS:Executable[1]/DTS:Property[@DTS:Name=''VersionGUID''][1]',
                                      'char(38)') AS UNIQUEIDENTIFIER) AS PackageVersionGUID ,
                PackageXML
        FROM    pkgStats

DECLARE @AllpkgStats TABLE
    (
      PackageId UNIQUEIDENTIFIER NULL ,
      PackagePath VARCHAR(900) PRIMARY KEY
                               NOT NULL ,
      PackageName VARCHAR(900) NULL ,
      PackageCreatorsName NVARCHAR(500) NULL ,
      PackageFormatVersion SMALLINT NULL ,
      PackageType NVARCHAR(50) NULL ,
      PackageDescription NVARCHAR(2000) NULL ,
      PackageVersionMajor SMALLINT NULL ,
      PackageVersionMinor SMALLINT NULL ,
      PackageVersionBuild SMALLINT NULL ,
      PackageVersionGUID UNIQUEIDENTIFIER NULL ,
      PackageTasksNumber INT NULL ,
      PackageContainersNumber INT NULL ,
      PackageDataFlowsNumber INT NULL ,
      PackageConnectionManagersNumber INT NULL ,
      PackageVariablesInEntirePackageNumber INT NULL ,
      PackageXML XML NULL
    ); 

INSERT  INTO @AllpkgStats
        ( PackageId ,
          PackagePath ,
          PackageName ,
          PackageCreatorsName ,
          PackageFormatVersion ,
          PackageType ,
          PackageDescription ,
          PackageVersionMajor ,
          PackageVersionMinor ,
          PackageVersionBuild ,
          PackageVersionGUID ,
          PackageXML
        )
        SELECT  p.PackageId ,
                p.PackagePath ,
                SUBSTRING(PackagePath,
                          LEN(PackagePath) - CHARINDEX('\',
                                                       REVERSE(PackagePath), 0)
                          + 2, LEN(PackagePath)) AS PackageName ,
                p.PackageCreatorsName ,
                p.PackageFormatVersion ,
                p.PackageType ,
                p.PackageDescription ,
                p.PackageVersionMajor ,
                p.PackageVersionMinor ,
                p.PackageVersionBuild ,
                p.PackageVersionGUID ,
                p.PackageXML
        FROM    @pkgStatsBase p;
		--Number of tasks
MERGE INTO @AllpkgStats AS t
USING
    ( SELECT    PackagePath ,
                COUNT(*) AS PackageTasksNumber
      FROM      @pkgStatsBase p
                CROSS    APPLY p.PackageXML.nodes('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
									//DTS:Executable[@DTS:ExecutableType!=''STOCK:SEQUENCE''
								and    @DTS:ExecutableType!=''STOCK:FORLOOP''
								and    @DTS:ExecutableType!=''STOCK:FOREACHLOOP''
								and not(contains(@DTS:ExecutableType,''.Package.''))]') Pkg ( props )
      GROUP BY  PackagePath
    ) s
ON ( t.PackagePath = s.PackagePath )
WHEN MATCHED THEN
    UPDATE SET PackageTasksNumber = s.PackageTasksNumber;

		--Number of containers
MERGE INTO @AllpkgStats AS t
USING
    ( SELECT    PackagePath ,
                COUNT(*) AS PackageContainersNumber
      FROM      @pkgStatsBase p
                CROSS    APPLY p.PackageXML.nodes('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
									//DTS:Executable[@DTS:ExecutableType=''STOCK:SEQUENCE''
									or    @DTS:ExecutableType=''STOCK:FORLOOP''
									or    @DTS:ExecutableType=''STOCK:FOREACHLOOP'']') Pkg ( props )
      GROUP BY  PackagePath
    ) s
ON ( t.PackagePath = s.PackagePath )
WHEN MATCHED THEN
    UPDATE SET PackageContainersNumber = s.PackageContainersNumber
WHEN NOT MATCHED BY SOURCE THEN
    UPDATE SET PackageContainersNumber = 0;

		--Number of data flows
MERGE INTO @AllpkgStats AS t
USING
    ( SELECT    PackagePath ,
                COUNT(*) AS PackageDataFlowsNumber
      FROM      @pkgStatsBase p
                CROSS    APPLY p.PackageXML.nodes('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
									//DTS:Executable[contains(@DTS:ExecutableType,''.Pipeline.'')]') Pkg ( props )
      GROUP BY  PackagePath
    ) s
ON ( t.PackagePath = s.PackagePath )
WHEN MATCHED THEN
    UPDATE SET PackageDataFlowsNumber = s.PackageDataFlowsNumber
WHEN NOT MATCHED BY SOURCE THEN
    UPDATE SET PackageDataFlowsNumber = 0;

		--Number of connection managers
MERGE INTO @AllpkgStats AS t
USING
    ( SELECT    PackagePath ,
                COUNT(*) AS PackageConnectionManagersNumber
      FROM      @pkgStatsBase p
                CROSS    APPLY p.PackageXML.nodes('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
									//DTS:ConnectionManager') Pkg ( props )
      GROUP BY  PackagePath
    ) s
ON ( t.PackagePath = s.PackagePath )
WHEN MATCHED THEN
    UPDATE SET PackageConnectionManagersNumber = s.PackageConnectionManagersNumber
WHEN NOT MATCHED BY SOURCE THEN
    UPDATE SET PackageConnectionManagersNumber = 0;

		--Number of variables in entire package
MERGE INTO @AllpkgStats AS t
USING
    ( SELECT    PackagePath ,
                COUNT(*) AS PackageVariablesInEntirePackageNumber
      FROM      @pkgStatsBase p
                CROSS    APPLY p.PackageXML.nodes('declare namespace DTS="www.microsoft.com/SqlServer/Dts";
									//DTS:Variable') Pkg ( props )
      GROUP BY  PackagePath
    ) s
ON ( t.PackagePath = s.PackagePath )
WHEN MATCHED THEN
    UPDATE SET PackageVariablesInEntirePackageNumber = s.PackageVariablesInEntirePackageNumber
WHEN NOT MATCHED BY SOURCE THEN
    UPDATE SET PackageVariablesInEntirePackageNumber = 0;

INSERT  INTO AdminDBA.dbo.LogSSISErrors_Package
        ( PackageID ,
          PackageName ,
          PackagePath ,
          PackageCreatorsName ,
          PackageFormatVersion ,
          PackageType ,
          PackageDescription ,
          PackageVersionMajor ,
          PackageVersionMinor ,
          PackageVersionBuild ,
          PackageVersionGUID ,
          PackageTasksNumber ,
          PackageXML ,
          PackageContainersNumber ,
          PackageDataFlowsNumber ,
          PackageConnectionManagersNumber ,
          PackageVariablesInEntirePackageNumber ,
          CurrentlyUsed
        )
        SELECT  PackageId ,
                PackageName ,
                PackagePath ,
                PackageCreatorsName ,
                PackageFormatVersion ,
                PackageType ,
                PackageDescription ,
                PackageVersionMajor ,
                PackageVersionMinor ,
                PackageVersionBuild ,
                PackageVersionGUID ,
                PackageTasksNumber ,
                PackageXML ,
                PackageContainersNumber ,
                PackageDataFlowsNumber ,
                PackageConnectionManagersNumber ,
                PackageVariablesInEntirePackageNumber ,
                1 AS CurrentlyUsed
        FROM    @AllpkgStats

IF EXISTS ( SELECT  *
            FROM    sys.tables
            WHERE   name = N'pkgStats' )
    DROP	TABLE pkgStats;

EXEC sp_configure 'xp_cmdshell', 0;
RECONFIGURE;
EXEC sp_configure 'show advanced options', 0;
RECONFIGURE;

Upon execution completion this code should create a small set of tables starting with LogSSISErrors prefix and metadata entries in some or all tables based on your local SQL Server instance as per the image below.

In the NEXT POST I will go through the process of creating a sample test environment, an SSIS package running an error-generating process and another variation of the above stored procedure used to update the database as the environment changes occur. Also, all the code used in this solution can be downloaded from my OneDrive folder HERE.

Posted in: Data Modelling, How To's, SQL

Tags: Data Modelling, SQL, SQL Server

What you are looking at...

My name is Martin and this site is a random collection of recipes and reflections about various topics covering information management, data engineering, machine learning, business intelligence and visualisation plus everything else that I fancy to categorise under the 'analytics' umbrella. I'm a native of Poland but since my university days I have lived in Melbourne, Australia and worked as a DBA, developer, data architect, technical lead and team manager. My main interests lie in both, helping clients in technical aspects of information management e.g. data modelling, systems architecture, cloud deployments as well as business-oriented strategies e.g. enterprise data solutions project management, data governance and stewardship, data security and privacy or data monetisation. On the whole, I am very fond of anything closely or remotely related to data and as long as it can be represented as a string of ones and zeros and then analysed and visualised, you've got my attention!

Outside sporadic updates to this site I typically find myself fiddling with data, spending time with my kids or a good book, the gym or watching a good movie while eating Polish sausage with Zubrowka (best served on rocks with apple juice and a lime twist). Please read on and if you find these posts of any interests, don't hesitate to leave me a comment!

Subscribe via RSS | Comments (RSS)

Advertising

Recent Comments
1. terry w on Using Polybase and In-Database Python Runtime for Building SQL Server to Azure Data Lake Data Extraction PipelinesMay 12, 2025
  Never thought Polybase could be used in this capacity and don't think Microsoft advertised this feature (off-loading DB data as…
2. admin on SQL Server to Snowflake Solution Architecture and Implementation – How to Extract, Ingest and Model Wide World Importers Database using Snowflake PlatformMay 5, 2025
  Hi Karissa It's definitely doable and I don't foresee too many changes from what what was outlined in the post.…
3. Karissa Lambert on SQL Server to Snowflake Solution Architecture and Implementation – How to Extract, Ingest and Model Wide World Importers Database using Snowflake PlatformMay 5, 2025
  Thanks for the details write-up Martin. I have one question though, can something similar be developed on top of Azure…
4. Leonard Chu on AWS S3 data ingestion and augmentation patterns using DuckDB and PythonApril 30, 2025
  Very informative, love these little nuggets of knowledge from the DuckDB community and anyone involved, thanks Martin!
5. admin on AWS S3 data ingestion and augmentation patterns using DuckDB and PythonJanuary 23, 2025
  Not sure about Polars but I did take ClickHouse (chDB) and DuckDB for a quick test drive - you can…