Computer Science > QUESTIONS & ANSWERS > Hadoop Certification Test Study Guide (All)
Hadoop Certification Test Study Guide For data in motion. Powered by Apache NiFi. 1) real-time - add, trace, adjust; 2) integrated - common input, output, transformation; 3) secure - security rules,... encryption, traceability; 4) adaptive - adapts data flow, scalable; if connection poor skinnies down data - >>>>Hortonworks Data Flow (HDF) A user-driven process of searching for patterns or specific items in a data set. Data discovery applications use visual tools such as geographical maps, pivot-tables, and heat-maps to make the process of finding patterns or specific items rapid and intuitive. Data discovery may leverage statistical and data mining. Ex. Web log analysis, online ad placement, claims notes mining - >>>>Data discovery Ex. sensor data ingest - >>>>ETL onboard Ex. individual driver histories - >>>>Active archive Perishable insights - >>>>Data in motion Historical insights - >>>>Data at rest Supports data discovery, single view, predictive analytics - >>>>Actionable intelligence A Single View application aggregates data from multiple sources into a central repository to create a single view of anything — of customers, inventory, systems - >>>>Single view Offers the leading platform for Operational Intelligence. It enables the curious to look closely at what others ignore—machine data—and find what others never see: insights that can help make your company more productive, profitable, competitive and secure - >>>>Splunk An open source big data processing framework built around speed, ease of use, and sophisticated analytics. It was originally developed in 2009 in UC Berkeley's AMPLab, and open sourced in 2010 as an Apache project - >>>>Apache Splunk Real-time event processing for sensor and business activity monitoring. A free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language. Ingests millions of events per second. Manage with Ambari. Horizontally scalable. Fixed, low latency and continuous processing for very high frequency streaming data. - >>>>Apache Storm Data operating system. Cluster resource management. 2013 - includes batch, interactive and realtime. At core of Hortonworks Data Platform (HDP) for data at rest. Centralized platform for: 1) operations - cluster management, one data lake or clusters; 2) governance - data lifecycle mgt, modeling with metadata, lineage capability 3) security - roles or data tags, encryption at rest and in motion, authentication. Includes data functions for: batch, machine learning, search, interactive, streaming - >>>>YARN SQL:2011 for analytics - >>>>Hive on YARN Data at rest. Powered by Open Enterprise Hadoop. 1) Open - open source; 2) Central - Yarn at core; 3) Interoperable - existing technology, skills; 4) Ready - enterprise-ready re operations, governance, security; dev efforts include: 1) data management; 2) data access; 3) governance and integration; 4) operations; 5) security - >>>>Hortonworks Data Platforms (HDP) An open source cluster computing framework originall [Show More]
Last updated: 1 year ago
Preview 1 out of 21 pages
Buy this document to get the full access instantly
Instant Download Access after purchase
Add to cartInstant download
We Accept:
Connected school, study & course
About the document
Uploaded On
Oct 30, 2022
Number of pages
21
Written in
This document has been written for:
Uploaded
Oct 30, 2022
Downloads
0
Views
75
In Browsegrades, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.
We're available through e-mail, Twitter, Facebook, and live chat.
FAQ
Questions? Leave a message!
Copyright © Browsegrades · High quality services·