Text mining with term extraction and term lookup transformations
In this recipe, you will see how text mining works. You will use a text file with some blogs as a source, extract terms from them, and then look up terms in them.
Getting ready
You need to have the Blogs.txt
file available. You can get it with the code from the code download for this book.
How to do it...
- In SSDT, add a new package to the
Chapter08
SSIS project. Name itTermExtactionLookup
. - On the
Control Flow
tab in thePackage Designer
, add two data flow tasks. Name themTermExtraction
andTermLookup
and connect them with the green arrow. The term extraction task should be first. - Click the
Data Flow
tab. Select theTermExtraction
task. Prepare the data flow for this task. - Add a flat file source. Name it
ImportBlogs
. Set the properties for this source:- Create a new connection manager called
BlogsTxt
, whose source file isC:\SSIS2016Cookbook\Blogs.txt
. Select the column names that are in the first data row checkbox. - Click
Columns
....
- Create a new connection manager called