Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
Business Intelligence with MicroStrategy Cookbook

You're reading from   Business Intelligence with MicroStrategy Cookbook Over 90 practical, hands-on recipes to help you build your MicroStrategy business intelligence project, including more than a 100 screencasts with this book and ebook

Arrow left icon
Product type Paperback
Published in Oct 2013
Publisher Packt
ISBN-13 9781782179757
Length 356 pages
Edition 1st Edition
Arrow right icon
Author (1):
Arrow left icon
Davide Moraschi Davide Moraschi
Author Profile Icon Davide Moraschi
Davide Moraschi
Arrow right icon
View More author details
Toc

Table of Contents (25) Chapters Close

Business Intelligence with MicroStrategy Cookbook
Credits
Foreword
About the Author
About the Reviewers
www.PacktPub.com
Preface
1. Getting Started with MicroStrategy FREE CHAPTER 2. The First Steps in a MicroStrategy Project 3. Schema Objects – Attributes 4. Objects – Facts and Metrics 5. Data Display and Manipulation – Reports 6. Data Analysis and Visualization – Graphs 7. Analysis on the Web – Documents and Dashboards 8. Dynamic Selection with Filters and Prompts 9. Mobile BI for Developers 10. Mobile BI for Users 11. Consolidations, Custom Groups, and Transformations 12. In-Memory Cubes and Visual Insight 13. MicroStrategy Express Solution to Exercises Where to Look for Information Cloudera Hadoop HP Vertica Index

Connecting to a Hadoop database


To show how we can connect to a Hadoop database, I will use two virtual machines: one with MicroStrategy Suite and the second with Cloudera Hadoop distribution, specifically, a virtual appliance that is available for download from their website.

The configuration of the Hadoop cluster is out of scope; moreover, I am not a Hadoop expert. I'll simply give some hints, feel free to use any other configuration/vendor, the procedure and ODBC parameters should be similar.

Getting ready

Start by going to http://at5.us/AppAU1

The Cloudera VM download is almost 3 GB (cloudera-quickstart-vm-4.3.0-vmware.tar.gz) and features the CH4 version. After unpacking the archive, you'll find a cloudera-quickstart-vm-4.3.0-vmware.ovf file that can be opened with VMware, see screen capture:

Accept the defaults and click on Import to generate the cloudera-quickstart-vm-4.3.0-vmware virtual machine.

Before starting the Cloudera appliance, change the network card settings from NAT to Bridged since we need to access the database from another VM:

Leave the rest of the parameters, as per the default, and start the machine.

After a while, you'll be presented with a graphical interface of Centos Linux. If the network has started correctly, the machine should have received an IP address from your network DHCP. We need a fixed rather than dynamic address in the Hadoop VM, so:

  1. Open the System | Preferences | Network Connections menu.

  2. Select the name of your card (should be something like Auto eth1) and click on Edit….

  3. Move to the IPv4 Settings tab and change the Method from Automatic (DHCP) to Manual.

  4. Click on the Add button to create a new address. Ask your network administrator for details here and fill Address, Netmask, and Gateway.

  5. Click on Apply… and when prompted type the root password cloudera and click on Authenticate. Then click on Close.

  6. Check if the change was successful by opening a Terminal window (Applications | System Tools | Terminal) and issue the ifconfig command, the answer should include the address that you typed in step 4.

  7. From the MicroStrategy Suite virtual machine, test if you can ping the Cloudera VM.

    When we first start Hadoop, there are no tables in the database, so we create the samples:

  8. In the Cloudera virtual machine, from the main page in Firefox open Cloudera Manager, click on I Agree in the Information Assurance Policy dialog.

  9. Log in with username admin and password admin.

  10. Look for a line with a service named oozie1, notice that it is stopped. Click on the Actions button and select Start….

  11. Confirm with the Start button in the dialog. A Status window will pop up, wait until the Progress is reported as Finished and close it.

  12. Now click on the Hue button in the bookmarks toolbar.

  13. Sign up with username admin and password admin, you are now in the Hue home page.

  14. Click on the first button in the blue Hue toolbar (tool tip: About Hue) to go to the quick Start Wizard.

  15. Click on the Next button to go to Step 2: Examples tab.

  16. Click on Beeswax (Hive UI) and wait until a message over the toolbar says Examples refreshed.

  17. Now in the Hue toolbar, click on the seventh button from the left (tool tip: Metastore Manager), you will see the default database with two tables: sample_07 and sample_08.

  18. Enable the checkbox of sample_08 and click on the Browse Data button. After a while the Results tab shows a grid with data. So far so good.

  19. We now go back to the Cloudera Manager to start the Impala service. Click on the Cloudera Manager bookmark button.

  20. In the Impala1 row, open the Actions menu and choose Start…, then confirm Start.

  21. Wait until the Progress says Finished, then click on Close in the command details window.

  22. Go back to Hue and click on the fourth button on the toolbar (tool tip: Cloudera Impala (TM) Query UI).

  23. In the Query Editor text area, type select * from sample_08 and click on Execute to see the table content.

Next, we open the MicroStrategy virtual machine and download the 32-bit Cloudera ODBC Driver for Apache Hive, Version 2.0 from http://at5.us/AppAU2.

Download the ClouderaHiveODBCSetup_v2_00.exe file and save it in C:\install.

How to do it...

We install the ODBC driver:

  1. Run C:\install\ClouderaHiveODBCSetup_v2_00.exe and click on the Next button until you reach Finish at the end of the setup, accepting every default.

  2. Go to Start | All Programs | Administrative Tools | Data Sources (ODBC) to open the 32-bit ODBC Data Source Administrator (if you're on 64-bit Windows, it's in the SysWOW64 folder).

  3. Click on System DSN and hit the Add… button.

  4. Select Cloudera ODBC Driver for Apache Hive and click on Finish.

  5. Fill the Hive ODBC DSN Configuration with these case-sensitive parameters (change the Host IP according to the address used in step 4 of the Getting ready section):

    • Data Source Name: Cloudera VM

    • Host: 192.168.1.40

    • Port: 21050

    • Database: default

    • Type: HS2NoSasl

  6. Click on OK and then on OK again to close the ODBC Data Source Administrator.

  7. Now open the MicroStrategy Desktop application and log in with administrator and the corresponding password.

  8. Right-click on MicroStrategy Analytics Modules and select Create New Project….

  9. Click on the Create project button and name it HADOOP, uncheck Enable Change Journal for this project and click on OK.

  10. When the wizard finishes creating the project click on Select tables from the Warehouse Catalog and hit the button labeled New….

  11. Click on Next and type Cloudera VM in the Name textbox of the Database Instance Definition window.

  12. In this same window, open the Database type combobox and scroll down until you find Generic DBMS. Click on Next.

  13. In Local system ODBC data sources, pick Cloudera VM and type admin in both Database login and Password textboxes.

  14. Click on Next, then on Finish, and then on OK.

  15. When a Warehouse Catalog Browser error appears, click on Yes.

  16. In the Warehouse Catalog Options window, click on Edit… on the right below Cloudera VM.

  17. Select the Advanced tab and enable the radio button labeled Use 2.0 ODBC calls in the ODBC Version group.

  18. Click on OK. Now select the category Catalog | Read Settings in the left tree and enable the first radio button labeled Use standard ODBC calls to obtain the database catalog.

  19. Click on OK to close this window. When the Warehouse Catalog window appears, click on the lightning button (tool tip: Read the Warehouse Catalog) to refresh the list of available tables.

  20. Pick sample_08 and move it to the right of the shopping cart. Then right-click on it and choose Import Prefix.

  21. Click on Save and Close and then on OK twice to close the Project Creation Assistant.

  22. You can now open the project and update the schema.

From here, the procedure to create objects is the same as in any other project:

  1. Go to the Schema Objects | Attributes folder, and create a new Job attribute with these columns:

    • ID:

      Table: sample_08

      Column: code

    • DESC:

      Table: sample_08

      Column: description

  2. Go to the Fact folder and create a new Salary fact with salary column. Update the schema.

  3. Go to the Public Objects | Metrics folder and create a new Salary metric based on the Salary fact with Sum as aggregation function.

  4. Go to My Personal Objects | My Reports and create a new report with the Job attribute and the Salary metric:

There you go; you just created your first Hadoop report.

How it works...

Executing Hadoop reports is no different from running any other standard DBMS reports. The ODBC driver handles the communication with Cloudera machine and Impala manages the creation of jobs to retrieve data. From MicroStrategy perspective, it is just another SELECT query that returns a dataset.

There's more...

Impala and Hive do not support the whole set of ANSI SQL syntax, so in some cases you may receive an error if a specific feature is not implemented:

See the Cloudera documentation for details.

Note

You can watch screencasts of this recipe at:

See also

  • The Connecting to a Vertica database recipe in Appendix D, HP Vertica

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime