Monday, June 25, 2012

SSIS Step by Step 003 - Load multiple flat files in a loop container


Requirement & Preface


In last post (SSIS Step by Step 002 - Output data to multiple flat files) , we output data from database to multiple flat files.

In this post, we're going to load data from multiple flat files to database table back. 

We are going to learn:
1. How to use for each loop container in SSIS package.
2.  How to use Foreach File Enumerator
 
Testing files are generated by (SSIS Step by Step 002 - Output data to multiple flat files), about the address table schema you can find them in SSIS Step by Step 001 - Load data from a flat file

Here's a folder which contains many flat files, each file contains address information and all address records in a single file has the same city.

Source folder:


Destination table: Address

SSIS Package Framework Screenshot 



Steps:
1. Create a new SSIS package with a name - CH03_LoadBatchFile.dtsx
  
2. Create a package variable - FileName, it will be used to save the single file name in a loop.
3. Create a 'Execute SQL Task' with name - 'EST_TruncateCities', edit it .
    Configure the connection and SQL Statement.
    This task is to truncate address table before loading data. 

    
4. Add a 'Foreach Loop Container' and edit it.
    In Collection Pane - 
    Enumerator - Foreach File Enumerator
    Folder - Where the flat file locate
    Files - CHO2-Citys-*.txt         * means any characters. 


   In Variable Mappings Pane - Select package variable 'FileName' to map the value in loop. 



5. Drag a data flow task, and add below two components in data flow.
    Flat File Source - FF_SRC_Address
    OLE DB Destination
6. The source is from flat files, so edit 'FF_SRC_Address'.
     
    Browse and choose a source flat file, because the files which match the 'CHO2-Citys-*.txt' have the same file format, so we can choose any matched file first to create mapping columns in this editor.



Configure the properties of each column - please refer to Step by Step 001 - Load data from a flat file



7. Edit 'OLE DB Destination'



 8. Now we need to configure the flat file connection manager, the current flat file connection is tied to a specified file. We need to configure it with a new value which is from loop container - User::FlatFile in step 4.



Click 'CM_FF_AddressByCity' connection manager, and edit its expression.

Map connection string property  to package variable 'FileName'



9. Save package and run it.







Get all SSIS Step by Step, refer to  SIMON'S SSIS Step by Step

Please let me know if you have any questions about this blog.
Contact to me by email - simonlvpin@yahoo.com
Or Skype - simonlvpin

Wednesday, June 20, 2012

SQL - XML Auto


USE AdventureWorks2008R2
GO

-- Auto with ELEMENTS
SELECT p.BusinessEntityID,
       p.FirstName,
       p.MiddleName,
       p.LastName
FROM Person.Person AS p -- The element name will be same with 'p'
FOR XML AUTO,
ROOT('Persons'),
ELEMENTS XSINIL

/** -- Output --
<Persons xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
   <p>
    <BusinessEntityID>285</BusinessEntityID>
    <FirstName>Syed</FirstName>
    <MiddleName>E</MiddleName>
    <LastName>Abbas</LastName>
  </p>
  <p>
    <BusinessEntityID>293</BusinessEntityID>
    <FirstName>Catherine</FirstName>
    <MiddleName>R.</MiddleName>
    <LastName>Abel</LastName>
  </p>
  <p>
    <BusinessEntityID>295</BusinessEntityID>
    <FirstName>Kim</FirstName>
    <MiddleName xsi:nil="true" />
    <LastName>Abercrombie</LastName>
  </p>
</Persons>
**/

-- Auto ELEMENTS
SELECT person.BusinessEntityID AS ID,
       person.FirstName        AS firstName,
       person.MiddleName       AS middleName,
       person.LastName         AS lastName
FROM Person.Person             AS person  -- Row Name
FOR XML AUTO,
ROOT('Persons'),
ELEMENTS XSINIL

/** -- Output -- Notice the output element name
<Persons xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <person>
    <ID>285</ID>
    <firstName>Syed</firstName>
    <middleName>E</middleName>
    <lastName>Abbas</lastName>
  </person>
  <person>
    <ID>293</ID>
    <firstName>Catherine</firstName>
    <middleName>R.</middleName>
    <lastName>Abel</lastName>
  </person>
  <person>
    <ID>295</ID>
    <firstName>Kim</firstName>
    <middleName xsi:nil="true" />
    <lastName>Abercrombie</lastName>
  </person>
</Persons>
**/

SQL - XML RAW


USE AdventureWorks2008R2
GO

-- Use RAW modal to create XML
SELECT p.FirstName AS firstName,
    p.MiddleName AS middleName,
    p.LastName AS lastName
FROM Person.Person AS p
FOR XML RAW

/** -- Output --
<row firstName="Syed" middleName="E" lastName="Abbas" />
<row firstName="Catherine" middleName="R." lastName="Abel" />
<row firstName="Kim" lastName="Abercrombie" />
<row firstName="Kim" lastName="Abercrombie" />
<row firstName="Kim" middleName="B" lastName="Abercrombie" />
<row firstName="Hazem" middleName="E" lastName="Abolrous" />
**/

-- Use RAW modal and set the ROOT for XML elements
SELECT p.FirstName,
    p.MiddleName,
    p.LastName
FROM Person.Person AS p
FOR XML RAW('Person'),
   ROOT('Persons')
/** -- Output --
<Persons>
  <Person FirstName="Syed" MiddleName="E" LastName="Abbas" />
  <Person FirstName="Catherine" MiddleName="R." LastName="Abel" />
  <Person FirstName="Kim" LastName="Abercrombie" />
  <Person FirstName="Kim" LastName="Abercrombie" />
  <Person FirstName="Kim" MiddleName="B" LastName="Abercrombie" />
  <Person FirstName="Hazem" MiddleName="E" LastName="Abolrous" />
  <Person FirstName="Sam" LastName="Abolrous" />
</Persons>
**/

-- Use RAW modal with ELEMENTS and XSINIL
-- Notice the Person who doesn't have the Middle Name 
SELECT p.FirstName,
    p.MiddleName,
    p.LastName
FROM Person.Person AS p
FOR XML RAW('Person'),
   ROOT('Persons'),
   ELEMENTS XSINIL
/** -- Output --
<Persons xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <Person>
    <FirstName>Syed</FirstName>
    <MiddleName>E</MiddleName>
    <LastName>Abbas</LastName>
  </Person>
  <Person>
    <FirstName>Catherine</FirstName>
    <MiddleName>R.</MiddleName>
    <LastName>Abel</LastName>
  </Person>
  <Person>
    <FirstName>Kim</FirstName>
    <MiddleName xsi:nil="true" />
    <LastName>Abercrombie</LastName>
  </Person>
</Persons>
**/

-- Use RAW modal with ELEMENTS 
-- Notice the Person who doesn't have the Middle Name 
SELECT p.FirstName,
    p.MiddleName,
    p.LastName
FROM Person.Person AS p
FOR XML RAW('Person'),
   ROOT('Persons'),
   ELEMENTS  
/** -- Output --
<Persons>
  <Person>
    <FirstName>Syed</FirstName>
    <MiddleName>E</MiddleName>
    <LastName>Abbas</LastName>
  </Person>
  <Person>
    <FirstName>Catherine</FirstName>
    <MiddleName>R.</MiddleName>
    <LastName>Abel</LastName>
  </Person>
  <Person>
    <FirstName>Kim</FirstName>
    <LastName>Abercrombie</LastName>
  </Person>
</Persons>
**/

SSIS Q001- How to get file names in a specified directory and its sub directory

Question - How to get the file name list from a directory and sub-directory ?

We can use script component in SSIS make a simple test.


public void Main()
        {
            string[] fileNames = Directory.GetFiles(@"D:\","*.*",SearchOption.AllDirectories);

            foreach (string filename in fileNames)
            {
                MessageBox.Show(filename);
            }
            // TODO: Add your code here
            Dts.TaskResult = (int)ScriptResults.Success;
        }

string[] fileNames = Directory.GetFiles(@"D:\","*.*",SearchOption.AllDirectories);


It gets all files under disk D including the sub directories. The file type can be any type - *.*. Of course, we change the filter of file type to be '*.txt' or '*.doc'. 


Run package,sometimes we will get an error 




See error information from 'Execute Results'



Error: System.Reflection.TargetInvocationException: Exception has been thrown by the target of an invocation. ---> System.UnauthorizedAccessException: Access to the path 'D:\System Volume Information' is denied.
   at System.IO.__Error.WinIOError(Int32 errorCode, String maybeFullPath)
   at System.IO.Directory.InternalGetFileDirectoryNames(String path, String userPathOriginal, String searchPattern, Boolean includeFiles, Boolean includeDirs, SearchOption searchOption)
   at System.IO.Directory.GetFiles(String path, String searchPattern, SearchOption searchOption)
   at ST_d1ce901e2c294acd880507ab2aac6c74.csproj.ScriptMain.Main()
   --- End of inner exception stack trace ---
   at System.RuntimeMethodHandle._InvokeMethodFast(Object target, Object[] arguments, SignatureStruct& sig, MethodAttributes methodAttributes, RuntimeTypeHandle typeOwner)
   at System.Reflection.RuntimeMethodInfo.Invoke(Object obj, BindingFlags invokeAttr, Binder binder, Object[] parameters, CultureInfo culture, Boolean skipVisibilityChecks)
   at System.Reflection.RuntimeMethodInfo.Invoke(Object obj, BindingFlags invokeAttr, Binder binder, Object[] parameters, CultureInfo culture)
   at System.RuntimeType.InvokeMember(String name, BindingFlags bindingFlags, Binder binder, Object target, Object[] providedArgs, ParameterModifier[] modifiers, CultureInfo culture, String[] namedParams)
   at Microsoft.SqlServer.Dts.Tasks.ScriptTask.VSTATaskScriptingEngine.ExecuteScript()



The key information is - Access to the path 'D:\System Volume Information' is denied.


We can search this information by Google,you can find many questions and solutions,but I just want to try my way. I think this hidden directory we can not access just because I didn't get the administration right to run this package. 


There're some system directories we really needn't to read,so if we want to load files from a directory and its sub-directories,we at least need to create a directory under the disk root.  



  public void Main()
        {
            string[] fileNames = Directory.GetFiles(@"D:\software\","*.*",SearchOption.AllDirectories);

            foreach (string filename in fileNames)
            {
                MessageBox.Show(filename);
            }
            // TODO: Add your code here
            Dts.TaskResult = (int)ScriptResults.Success;
        }

Test it, all files are shown in message box. 

Sunday, June 17, 2012

SSIS Step by Step 003 - Sync document status in specified folder

Requirement & Preface


In this post, we're going to do something about file processing. It doesn't like the Step by Step 002 ,Step 02 only output data to files.

Actually, this example should be a small project or a solution which covers the usage of many SSIS skills and components. It's a complicated demo, I will show you step by step. 

The requirements is:

1. Here's specified folder, for example -


We need to read this directory and record the information of all files into table. 
The information - File Name, Created Date, Modified Date, File Type, File Path, size etc.

2. If file is updated, the status will sync to database.
    For example, if the file A has been modified, the modified date will be a new date. When we run package, this file's information will be updated in database with new file size, modified date etc. 
3. If file is deleted, database will sync this information and won't show the file's information.
3. If a new file is added, database will sync this information and show the new file's information.
4. Also need to record historical information about files.

Some tech skills I won't show in this article, because they're shown in other posts before, please refer to 
SSIS Step by Step List to check them.

What's new in this post ?
1. How to get a file collection in script ?
2. How to get file's information like File Name, Created Date, Modified Date, File Type ?
3. How to use Look Up component to redirect Matched and NON-Matched records?
4. How to use Derived Column component to help to create a new column or replace some column value ?
5. How to design a simple documents-sync framework ? (Rome was not built in a day, the database design and SSIS package design will be updated until it meets all requirements, so the steps of this post will confuse you - how did you know this component should be here but not there ? This example is updated on and off for 5 times, I have to test it until it meet my requirements.)

Preparations:

1. Make sure you have created these tables in demo database - SSISDemoDB.
    You can get the schema of creating SSISDemoDB in Step 01 - SSIS Step by Step 001 - Load data from flat file
    
Scripts here:

USE SSISDemoDB
GO

IF OBJECT_ID('DocumentTypeMapping') IS NOT NULL
DROP TABLE DocumentTypeMapping

CREATE TABLE DocumentTypeMapping
(
      ID INT IDENTITY(1,1) PRIMARY KEY,
      FileExtension NVARCHAR(50) NOT NULL,
      FileTypeDescription NVARCHAR(100) NOT NULL
)

INSERT INTO DocumentTypeMapping VALUES
('dtsx','Integration Services Package'),
('database','Analysis Services Database'),
('dtproj','Integration Services project file'),
('user','Visual Studio Project User Options file'),
('dtsx','Integration Services Package'),
('txt','Text Document'),
('xlsx','Microsoft Excel Worksheet'),
('xlsx','Microsoft Excel Worksheet') ,
('vsd','Microsoft Visio Drawing'),
('doc','Microsoft Word Document 2003-2007  ')

IF OBJECT_ID('DocumentsStaging') IS NOT NULL
DROP TABLE DocumentsStaging

CREATE TABLE DocumentsStaging
(
      ID INT IDENTITY(1,1) PRIMARY KEY,
      NAME NVARCHAR(500) NOT NULL,
      FullPath NVARCHAR(2000) NOT NULL,
      CreatedDate DATETIME NOT NULL,
      FirstModifiedDate DATETIME NOT NULL,
      NewModifiedDate     DATETIME NOT NULL,
      FileType NVARCHAR(200) NOT NULL,
      Size BIGINT NOT NULL,
      BatchInsertTime DATETIME DEFAULT GETDATE() NOT NULL
)

IF OBJECT_ID('Documents') IS NOT NULL
DROP TABLE Documents

CREATE TABLE Documents
(
      ID INT IDENTITY(1,1) PRIMARY KEY,
      NAME NVARCHAR(500) NOT NULL,
      FullPath NVARCHAR(2000) NOT NULL,
      CreatedDate DATETIME NOT NULL,
      ModifiedDate DATETIME NOT NULL,
      FileType NVARCHAR(200) NOT NULL,
      Size INT NOT NULL,
      Operation NVARCHAR(50) NOT NULL
)

IF OBJECT_ID('DocumentsErrorExtension') IS NOT NULL
DROP TABLE DocumentsErrorExtension

CREATE TABLE DocumentsErrorExtension
(
      ID INT IDENTITY(1,1) PRIMARY KEY,
      NAME NVARCHAR(500) NOT NULL,
      FullPath NVARCHAR(2000) NOT NULL,
      FileType NVARCHAR(200) NOT NULL, 
)


Descriptions -

Table DocumentTypeMapping shows the description of specified file, usually we get file with its extension like - .doc , .txt, .ppt. We won't show user such information because the use may not understand what does '.ppt' or 'prot' mean, so configure this mapping before you composed your file processing package.

Table DocumentsStaging - It mostly like a historical table which will record all documents information in historically loading files from specified directory. 
For example, the first time we load folder 'Test', there're A,B,C file in this folder, we record these information when the first time we run the package. But at the second time when we run this package, we only find file B inside and a new file D.
Then we will record like this -

ID, File Name, BatchInsertTime
1   A          2011
2   B          2011
3   C          2011
4   B          2012
5   D          2012

Table Documents will only show the current existing files in specified folder.
So the first time when load files, it will show - 

ID, File Name, BatchInsertTime
1   A          2011
2   B          2011
3   C          2011

The second time it will show - 
ID, File Name, BatchInsertTime
1   B          2012
2   D          2012

Because system will think the file A and C have been deleted by user or their name has been changed. 

So if use want a report to show files information, table DocumentsStaging can show the history and table Documents can show current information of files.

Table DocumentsErrorExtension will record all mismatched file type. For example, there's no description for file extension - '.mdb'. If package process a new file with file extension .mdb, the information the this file won't be recorded in table Documents, it will be recored in DocumentsErrorExtension to let system admin know this file doesn't has a file type description, we need to insert a description for this file first.

SSIS Framework

Control Flow -


Data Flow - 


Package Variables -

Steps:

1. Create a package 'CH03_SyncDocuments' and defined package variables as above figure.
Package variable - Files. It will be used to store all file names from specified package variable - Directory.
Please assign your custom file folder path to 'Directory'

2. Drag a script component to Control Flow Pane, name it as 'SRC_GetFileNames', click to open it.

    ReadOnlyVariables - Directory (We have assigned a value to this variable in step 1)
    ReadWriteVariables - Files (All file names will be saved in this variable)

   
Click Edit Script and import System.IO in default class, add such codes.
        
        public void Main()
        {
            //Get the directory from user defined variable
            string directory = Dts.Variables["User::Directory"].Value.ToString();

            //Get all file names in specified directory
            string[] fileNames = Directory.GetFiles(directory);

            //Save to user defined variable
            Dts.Variables["User::Files"].Value = fileNames;

            Dts.TaskResult = (int)ScriptResults.Success;
        }


Save these changes and run package to make sure if can you see a green status of component. If script component becomes green, that means we can load all file names correctly from specified folder.

We often do this test when we finished a configuration to each component. 

3. Drag a loop foreach container as figure shows (Files Loop):




User::FilePath's value is from User::Files in a loop. 

4. Drag a script component with name 'SCR_LoadFileInformation' into loop container, edit this component 

    We use User::FilePath to load a file and read all properties of this file.    
    ReadWriteVariables are variables which will save all loaded properties from a file.


Edit script, import System.IO and try to understand such code inside -

        public void Main()
        {
            string fileName = Dts.Variables["User::FilePath"].Value.ToString();

            FileInfo file = new FileInfo(fileName);

            Dts.Variables["User::Name"].Value = file.Name;
            Dts.Variables["User::CreatedDate"].Value = file.CreationTime;
            Dts.Variables["User::ModifiedDate"].Value = file.LastWriteTime;
            Dts.Variables["User::FileType"].Value = file.Extension.Replace('.',' ').Trim();
            Dts.Variables["User::Size"].Value = (int)(file.Length/1024);
            Dts.Variables["User::FilePath"].Value = file.FullName;

            Dts.TaskResult = (int)ScriptResults.Success;
        }

How to pass a package variables and how to assign value to package variables ? You should know the answers.

Run this package, you should make sure all components are green finally.

5. In loop container, add Execute SQL Task with name EST_InsertFileInLoop, edit it.

    You should know how to build a OLE DB connection, I have mentioned it in previous posts.
    
    For SQL Statement, we need to build a SQL to insert all document information to a staging table, the parameter will use ? to instead of the real value, we will configure parameters in 'Parameter Mapping'.


We need to know which values we should to insert, then map them like below figure shows:

CREATE TABLE DocumentsStaging
(
ID INT IDENTITY(1,1) PRIMARY KEY,
NAME NVARCHAR(500) NOT NULL,                           -- User::Name
FullPath NVARCHAR(2000) NOT NULL,                        -- User::FilePath
CreatedDate DATETIME NOT NULL,                              -- User::CreatedDate
FirstModifiedDate DATETIME NOT NULL,                    -- User::ModifiedDate
NewModifiedDate  DATETIME NOT NULL,                 -- User::ModifiedDate
FileType NVARCHAR(200) NOT NULL,                         -- User::FileType
Size BIGINT NOT NULL,                                                  -- User::Size  (KB)
BatchInsertTime DATETIME DEFAULT GETDATE() NOT NULL  -- System Variable: System::StartTime
)

   
FirstModifiedDate DATETIME NOT NULL,                    -- User::ModifiedDate
NewModifiedDate   DATETIME NOT NULL,                 -- User::ModifiedDate


  • In Windows OS, we use C# can easily get a file's properties like created date and modified date.   
  • For example, I created a file on 05/01/2011.  Then I updated this file with new contents on 06/01/2011.
  • So the created date property of this file is  - 05/01/2011, the modified date property is 06/01/2011

  • But for this demo, we need to know which files are new added, which files are new updated, the basis is we judge if the created date is same as modified date. If created date = modified date, this file is new added, if created date <> modified date, we think this file has been updated. 

  • The problem for this demo is, if we load this file on 07/01/2011 at the first time, what conclusion we should make ? Is is a updated file or a new file. Actually it should be a new file to us, it may be edited several times before our package load it.

  • So I design two columns in tables, both columns store the value of Last Access Time of the file. The FirstModifiedDate doesn't mean the real first update time (06/01/2011) but means the last modified time when loading this file to staging table.



BatchInsertTime DATETIME DEFAULT GETDATE() NOT NULL  -- System Variable: System::StartTime

  •     We need to record which files are loaded per every time we run this package. 



You should understand the above steps doing:

  • Get all file names and save to an object which is used in loop container.
  • Each loop, we can use a file name to load a file and get properties of this file. All properties of a file are save in different package variables.
  • Map these package variables and compose SQL with these variable to insert a file into staging table.
  • When the loop ends, the information of all files are recorded. 


6. Build Execute SQL Task to delete error extension information and add a data flow task to process the main sync logic.

Execute SQL Task - EST_DeleteErrorType
Data Flow Task - UpdateDocuments

Edit EST_DeleteErrorTypes with existing OLE DB connection.
SQL Statement - 

DELETE FROM dbo.DocumentsErrorExtension
WHERE FileType IN(
      SELECT DISTINCT FileType
      FROM dbo.DocumentsStaging
)



The component is used to delete all error information from DocumentErrorExtension.  If the file type of a file doesn't map to mapping table, the information will be recorded in DocumentErrorExtension.

Go into data flow task - click 'UpdateDocuments', we 're going to design data flow task to sync documents.

7. Please see the data flow framework again.



The main logic is -

  • Get last batch documents from staging table to look up documents table.
  • NO Match - The current file doesn't exist in documents table, then insert new file information to documents (Also need to compare the file type, if the file type doesn't exist in file type mapping table, it will redirect to error extension table).
  • Match - The current file exists in documents table, need to update document information in document table.

8. Add OLE DB Source in data flow, new name is 'OLE_SRC_GetStaging', edit it.
 
    Getting the last batch of files from staging table as a source.


  SELECT *
  FROM dbo.DocumentsStaging
  WHERE BatchInsertTime = (SELECT MAX(BatchInsertTime) FROM dbo.DocumentsStaging)



     Pass all columns to next component.



9. Add a look up component - 'LKP_LookUpDocuments', edit this component.
 
    Choose 'Redirect rows to no match output'


Use 'Document' table as a look up table.



Available Input columns are from 'OLE_SRC_GetStaging', please see Step 8.
Available Lookup Columns are from table Documents.

Simply to say, there're two tables - DocumentsStaging and Documents. Just like the join in SQL, we use NAME and CreatedDate to see if files in staging exist in Documents.

There're two results here, Matched and NON-Matched, so two outputs are also generated in this component - Matched Output and NON-Matched Output, these outputs can be a source for next data flow component.

Drag 'Name' in input column to the 'Name' in lookup columns, same operation to CreatedDate.


10. For No Match Output, we're going to insert a new record. But we need to check if the file type exist in mapping table first.


   Drag a lookup component 'LKP_Look Up File Extension' and tie to 'LKP_LookUpDocuments', choose 'Lookup No Match Output' when message box shows up.

   Drag a OLE DB Components  CMD_UpdateDocuments' and tie to 'LKP_LookUpDocuments', choose 'Lookup Match Output' when message box shows up.

   Need to understand the source of Matched and Non-Matched components.

   For Non Matched component 'LKP_Look Up File Extension', its source columns are from the original source 'OLE_SRC_GetStaging'.
   For Matched component 'CMD_UpdateDocuments', its source columns are from the join columns of 'OLE_SRC_GetStaging' and       'LKP_LookUpDocuments' base on 'Name' and 'CreatedDate'.

11. Edit LKP_Look Up File Extension


Choose look up table - DocumentTypeMapping


File Type in input columns maps to File Extension in look up columns.

   
12. If mapped, that means the file type from staging table is correct type, otherwise, it should be incorrect file type which haven't been recorded in file mapping table.


13. If mapped, we get those records from staging table with the file type description and insert into table Documents.
      Drag a Derived Column component ''DC_InsertOperation' and tie to 'LKP_Look Up File Extension' by using 'Lookup Match Output', edit it.


A new column 'Operation' will be built with a value 'NEW', this column will be added into up stream input columns and will be passed together to next component as a source.

14. Add an OLE DB Destination component 'DEST_InsertNewDocuments' to connect to ''DC_InsertOperation', edit it.


Do mapping.


15. Drag another component 'DEST_NoMatchedFileExtension', edit it.



16. CMD_UpdateDocuments


 OLE DB Connection


Edit SQL Command


UPDATE dbo.Documents
SET FullPath = ?,
        ModifiedDate = ?,
        Size = ?,
        Operation = 'UPDATE'
WHERE NAME = ?
          AND     FileType = ?
          AND     CreatedDate = ?
          AND     UPPER(Operation) = 'NEW'
          AND     ModifiedDate <>  ?





 17. CMD_DeleteNONExistDoc
     DELETE FROM dbo.Documents
  WHERE Name NOT IN(
      SELECT NAME FROM
      dbo.DocumentsStaging
      WHERE BatchInsertTime =
      (
            SELECT MAX(BatchInsertTime)
            FROM dbo.DocumentsStaging 
      )
  )


18. Testing 




19. Add a new file 




20. Update contents.



Configurations!


Get all SSIS Step by Step, refer to  SIMON'S SSIS Step by Step

Please let me know if you have any questions about this blog.
Contact to me by email - simonlvpin@yahoo.com
Or Skype - simonlvpin