http://biglogictech.com/ : Hadoop & MongoDB Classes
http://biglogictech.com/training/big-data-analytics/hadoop-architect-hadoop-consultant/ http://biglogictech.com/training/no-sql-databases/mongodb/ Our Popular Training Programs Big Data Expert Enroll Now MongoDB Expert Enroll Now
[MongoDB-Hadoop]: Connectivity testing
Install Hadoop CDH 4/5, Hive, Pig, Java & MongoDB and set environment variables as below. [root@dbversity.com ~]# cat ~/.bashrc | grep “export” export PATH=$PATH:/opt/mongodb/bin export JAVA_HOME=/usr/java/jdk1.8.0_05 export PATH=$JAVA_HOME/bin:$PATH export HADOOP_HOME=/hadoop export PATH=$HADOOP_HOME/bin:$PATH export HIVE_HOME=/hadoop/hive-0.12.0-cdh5.0.0 export PATH=$HIVE_HOME/bin:$PATH export PIG_HOME=/hadoop/pig-0.12.0-cdh5.0.0 export PATH=$PIG_HOME/bin:$PATH…
Hadoop and MongoDB
What is Hadoop? Hadoop is a software technology designed for storing and processing large volumes of data using a cluster of commodity servers and commodity storage. Hadoop is an open-source Apache project started in 2005 by engineers at Yahoo. It…
Uninstallation process of CDH (Cloudera Hadoop packages)
Here’are the uninstallation steps 1. Stop ALL services: a. service cloudera-scm-server stop b. service cloudera-scm-server-db stop c. service cloudera-scm-agent hard_stop 2. Uninstall the CDH and Cloudera Manager packages: [srinivas@dbversity.com ~]# hadoop version Hadoop 2.0.0-cdh4.5.0 Subversion file:///var/lib/jenkins/workspace/CDH4.5.0-Packaging-Hadoop/build/cdh4/hadoop/2.0.0-cdh4.5.0/source/hadoop-common-project/hadoop-common -r 30821ec616ee7a21ee8447949b7c6208a8f1e7d8 Compiled by…
Hadoop Streaming
Hadoop Streaming —————- Hadoop streaming is a utility that comes with the Hadoop distribution. The utility allows you to create and run Map/Reduce jobs with any executable or script as the mapper and/or the reducer. For example: $HADOOP_HOME/bin/hadoop jar $HADOOP_HOME/hadoop-streaming.jar…
Hadoop Archives & MR
Overview ——– Hadoop archives are special format archives. A Hadoop archive maps to a file system directory. A Hadoop archive always has a *.har extension. A Hadoop archive directory contains metadata (in the form of _index and _masterindex) and data…
MongoDB Connector for Hadoop
Purpose The MongoDB Connector for Hadoop is a library which allows MongoDB (or backup files in its data format, BSON) to be used as an input source, or output destination, for Hadoop MapReduce tasks. It is designed to allow greater…
Hadoop.In Excel.
Explore and analyze Big Data, without IT overhead https://datanitro.com/hadoop_in_excel.html
Apache Hadoop 2 & Apache Hadoop YARN videos
FYI Session Title Watch View Unlocking Hadoop’s Potential Video Enterprise Hadoop for Pools, Ponds, Clouds and Beyond Video Apache Hadoop YARN: Present and Future Video Slides YARN: The Key to Overcoming the Challenges of Broad-based Hadoop Adoption Video Slides One…
Hadoop Cluster set-up document
Running Hadoop on RHEL Linux (Multi-Node Cluster) Here’s we’ll see how to set-up multi-node Apache Hadoop cluster backed by the Hadoop Distributed File System (HDFS), running on RHEL Linux. Hadoop is a framework written in Java for running applications on…