Welcome to Live Chat

Welcome to LiveWebTutors Services, World's leading Academic solutions provider with Millions of Happy Students.

Call Back
logo

24x7 Support Available

To Get the Best Price Chat With Our Experts

chat now

In A Hurry? Get A Callback

shopping cart 0

Subject Solutions Code Description Price Delete

Amount Payable : $0

continue shopping proceed to checkout

World's Leading Assignment Library

Big Data Cloudera HUE

Question Preview:

Task-Practical Big Data Cloudera HUE exercise with two parts:Part 1-Working with large dataset BDMS processing and creating reportsPart 2 - Migrate the created set up away from Hive to the NoSQL scheme offered by Camel-MongoDB Requires Expert Skills for the 2 Tasks:Practical Experiences with Cloudera HUE, importing data in HUE, working within the Hive environment in Cloudera, scripting in Latin Pig,developing simple reports for the given task, database migration knowledge resource: Use the Cloudera HUE Demo account for this exercise.You get access to the HUE Demo account here: http://demo.g...

View Complete Question >>

Question Preview:

Task-Practical Big Data Cloudera HUE exercise with two parts:Part 1-Working with large dataset BDMS processing and creating reportsPart 2 - Migrate the created set up away from Hive to the NoSQL scheme offered by Camel-MongoDB Requires Expert Skills for the 2 Tasks:Practical Experiences with Cloudera HUE, importing data in HUE, working within the Hive environment in Cloudera, scripting in Latin Pig,developing simple reports for the given task, database migration knowledge resource: Use the Cloudera HUE Demo account for this exercise.You get access to the HUE Demo account here: http://demo.gethue.com/hue/editor/?type=hive Requirements:Create a 2.000wordstep by step documentation with detailed screenshots about each steps executing the practical exercise described below. As your task is divided in two parts. Part 1 should cover about 1.000 words and Part 2 also about 1.000 words. Cite and reference all sources using the Harvard Liverpool Referencing System (at least 5 references).Task Scenario:You have been engaged as aData advisorworking with Advanced Data Science Services (ADSS). ADSS has just been awarded a contract by a government department (the Department of Environment) to help with the management, data mining and visualization of atmospheric emissions (and pollution) data gathered by various borough and county environment monitoring units. You need to assist this project, and you will be required to carry out a number of tasks as described below using the Hadoop framework and tools. Preparation for your 2 Tasks:Prepare your Demo Account:Create a new directory called ADDS_Test for your task in the HUE Hive Demo environment.Screenshot: New directory called ADDS_Test Import the data for Analysis into your directory in the demo account. The data for the import you find under https://data.london.gov.uk/dataset/london-atmospheric-emissions-inventory-2013?resource=0fa6a83a-1529-48a7-890d-414a2f52b0e6. (It relates to the year 2013). The file is called: “3 - Detailed Road Transport - LAEI2013_MajorRoads_EmissionsbyLink_2013 (332.19 MB)” After the preparation now, your two tasks in Detail:Create a step by step documentation with detailed screenshots about each step, including the detailed syntax, executing the practical exercise described below. Part 1:Analyze your data’s. E.g. use relevant Hive DML statements, scripts and summary functions (e.g. max, min, count or avg) and generate at least two reports that summarize the data in the tables in an insightful way. Examples for your reports could be:1.Who creates the most Emissions (Unit=tonne/year) – Motorcycle, Taxi or Car?2.Which road creates the most pollution (Hint: e.g. the road with the largest length)? However, feel free to create another appropriate insight report for illustration. Explain your rationale for producing those particular report summaries. Part 2:1.Migrate the created set up away from Hive to the NoSQL scheme offered by Camel-MongoDB. Explain the prime considerations and actions that need to be taken for such a migration.2.It is intended to convert the static visualisation dashboards previously created to live ones via the use of streaming schemes offered by Spark and Kafka. Outline what steps would need to be taken to accomplish this.3.Explain how the use of Sentry and/or IAM (Identity Access Manager) for AWS may be used to help secure the cluster and the Hadoop deployment in an enterprise environment.

View Less >>

Solution Preview

Part-1:1- We will start by putting our file to HDFS using HUE interface. Since the file is in XLSX format,I converted it to CSV and then imported it to HDFS.Below is the screenshot of the file at my HDFS folder named ‘co2’

2- Then to load the file into the HIVE Table, we need to create a HIVE Table Schema. The hivetable was created on the same location as the csv file so that the data gets into the table.Below is the screenshot of the Hive table code.

3- Then using the HIVE DML it was checked that which among motorcycle, taxi and car producethe highest co2. It was found data CAR was producing the highest CO2 among all.Below are the code and the output.

question Get solution

$20

Orginal Price : $26.0

Pay Now

Upload Assignments

250 words

side

Get Your Assignment

Don’t delay more, place your order now. Quick assignment help will be offered to you.

Order Now

CUSTOMER REVIEWS

Excellent

logo

Based on 702 reviews See all reviews here

One of the Best Service

I trust LiveWebTutors for my assignments because of their ability to deliver the perfect assignments time and again. Only a few of my assignments required minor revisions. The rest assured it is the best assignment writing service in the market.

Elizabeth
Sydney

Great Service and on time

I felt so exhausted and burdened with the large number of assignments I had to write and desperately needed someone to help me with all the writing and there was LiveWebTutors company on the internet. They finished my assignments before the due date and also offered me a first-timer discount.

Christina
Perth

One of the Best Assignment Provider

I ordered my Mathematics and Marketing assignments from them last month. I received the content on the set date. Most importantly, the assignments were well-written and plagiarism free. I scored a top grade for the assignment written by them. They are a reliable company.

Oli
Brisbane

Very Helpful Customer Service

I was quite unsure about getting my assignment written online but after coming across LiveWebTutors.com, all my worries have vanished. The quality of the assignments written by their writers is just invincible. Their customer support is very polite and helpful. You should try their service at least once

Kabir
Adelaide