Big Data is the data which can not be processed by traditional database systems i.e.Mysql,Sql. Big data consist of data in the structured ie.Rows and Coloumns format ,semi-structured i.e.XML records and Unstructured format i.e.Text records,Twitter Comments.
Hadoop is an software framework for writing and running distributed applications that processes large amount of data.
Hadoop Distributed File System is a filesystem designed for large-scale distributed data processing under framework such as Mapreduce.
Hadoop framework consist of Storage area known as Hadoop Distributed File System(HDFS) and processing part known as MapReduce programming model.
Mapreduce is Data processing model consist of data processing primitives called Mapper and Reducer.
Hadoop mainly uses four input formats-FileInput Format, KeyValueTextInput Format, TextInput Format, NLineInput Format.
Hadoop works more effectively with single large file than number of smaller one.
Hadoop supports chaining MapReduce programs together to form a bigger job.We will explore various joining technique in hadoop
for simultaneously processing multiple datasets.Many complex tasks need to be broken down into simpler subtasks,each accomplished by an individual Mapreduce jobs.
Hadoop is best used as a write-once,Read-many-times type of datastore. With the help of hadoop large dataset will be divided into smaller (64 or 128 MB) blocks that are spread among many machines in the clusters via Hadoop Distributed File System.
Approachable-Hadoop runs on Huge clusters of appropriate Hardware apparatus.
Powerful-Because it is intentional to run on clusters of appropriate Hardware apparatus ,Hadoop is architect with the presumption of repeated hardware malfunctions.It can handle most of such failures.
Resizable-Hadoop mearsures sequentially to hold large data by including more nodes to the cluster.
Simple-Hadoop allows users to speedly write well-organized parallel codes.
Who can do this course - Any Graduate, Non IT / IT professionals
Total Duration - 3 Months (With real time project)
Placement - 100% Job Assurance
Training Topics -