Database Reference
In-Depth Information
TIP
If you are using SSIS 2012, you can use the ODBC Source instead of the
ADO.NET
Source to gain a little performance. The interface is almost
identical, and setting it up is exactly the same as the
ADO.NET
Source
except that you will pick an ODBC connection manager rather than an
ADO.NET
connection manager.
Figure 8.3
ADO.NET
Source.
For the best SSIS experience, make sure that the query being run against
Hive returns the smallest number of rows and columns possible. Hive tables
can be quite large, so it's important for package performance that you
minimize the amount of data retrieved by SSIS. One way to do this is to not
use
SELECT * FROM
table in the queries; instead, list the specific columns
that you need to use. Also, you will generally want to apply some sort of
WHERE
clause to the query to reduce the number of rows returned.