Microsoft Perform Data Engineering on Microsoft Azure HDInsight - 070-775 Exam Practice Test
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You need to deploy an enterprise data warehouse that will support in-memory analytics. The data warehouse must support connections that use the Microsoft Hive ODBC Driver and Beeline. The data warehouse will be managed by using Apache Amrabi only.
What should you do?
You need to deploy an enterprise data warehouse that will support in-memory analytics. The data warehouse must support connections that use the Microsoft Hive ODBC Driver and Beeline. The data warehouse will be managed by using Apache Amrabi only.
What should you do?
Correct Answer: D
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have an Apache Interactive Hive cluster in Azure HDInsight. The cluster has 12 processors and 96 GB of RAM. The YARN container size is set to 2 GB and the Tez container size is 3 GB.
You configure one Tez container per processor.
You are performing map joints between a 2-GB dimension table and a 96-GB fact table.
You experience slow performance due to an inadequate utilization of the available resources.
You need to ensure that the map joins are used.
Which two settings should you configure? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
You configure one Tez container per processor.
You are performing map joints between a 2-GB dimension table and a 96-GB fact table.
You experience slow performance due to an inadequate utilization of the available resources.
You need to ensure that the map joins are used.
Which two settings should you configure? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
Correct Answer: C
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.

Does this meet the goal?
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.

Does this meet the goal?
Correct Answer: A
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have an Apache Spark cluster in Azure HDInsight.
Users report that Spark jobs take longer than expected to complete.
You need to reduce the amount of time it takes for the Spark jobs to complete.
What should you do?
Users report that Spark jobs take longer than expected to complete.
You need to reduce the amount of time it takes for the Spark jobs to complete.
What should you do?
Correct Answer: D
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
HOTSPOT
You install the Microsoft Hive ODBC Driver on a computer that runs Windows 10 and has the 64-bit version of Microsoft Office 2016 installed.
You deploy a new Apache Interactive Hive cluster in Azure HDInsight. The cluster is hosted at myHDICluster.azurehdinsight.net and contains a Hive table name hivesampletable that has 200,000 rows.
You plan to use HiveQL exclusively for the queries. The queries will return from 6,000 to 10,000 rows 90 percent of the time.
You need to configure a data source to ensure that you can use Microsoft Excel to access the dat a. The solution must ensure that the Hive queries execute as quickly as possible.
How should you configure the Advanced Options from the Microsoft Hive ODBC Driver DSN Setup dialog box? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

You install the Microsoft Hive ODBC Driver on a computer that runs Windows 10 and has the 64-bit version of Microsoft Office 2016 installed.
You deploy a new Apache Interactive Hive cluster in Azure HDInsight. The cluster is hosted at myHDICluster.azurehdinsight.net and contains a Hive table name hivesampletable that has 200,000 rows.
You plan to use HiveQL exclusively for the queries. The queries will return from 6,000 to 10,000 rows 90 percent of the time.
You need to configure a data source to ensure that you can use Microsoft Excel to access the dat a. The solution must ensure that the Hive queries execute as quickly as possible.
How should you configure the Advanced Options from the Microsoft Hive ODBC Driver DSN Setup dialog box? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Correct Answer:

References: https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-connect-excel-hive-odbc- driver
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You need to deploy a NoSQL database to an HDInsight cluster. You will manage the server that host the database by using Remote Desktop. The database must use the key/value pair format in a columnar model.
What should you do?
You need to deploy a NoSQL database to an HDInsight cluster. You will manage the server that host the database by using Remote Desktop. The database must use the key/value pair format in a columnar model.
What should you do?
Correct Answer: B
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have an Azure HDlnsight cluster.
You need to build a solution to ingest real-time streaming data into nonrelational distributed database.
What should you use to build the solution?
You need to build a solution to ingest real-time streaming data into nonrelational distributed database.
What should you use to build the solution?
Correct Answer: B
Vote an answer