Shazam It! Music Recognition Algorithms, Fingerprinting, and Processing
You hear a familiar song in the club or the restaurant. You listened to this song a thousand times long ago, and the sentimentality of the song really touches your heart. You desperately want to heart it tomorrow, but you…
Ractive.js – Web Apps Made Easy
In today’s rapidly proliferating landscape of JavaScript frameworks and libraries, selecting the one on which you want to base your development can be quite a challenge. After all, once you go down the path of using a particular framework, migrating your code…
Predicting Likes: Inside A Simple Recommendation Engine’s Algorithms
A recommendation engine (sometimes referred to as a recommender system) is a tool that lets algorithm developers predict what a user may or may not like among a list of given items. Recommendation engines are a pretty interesting alternative to search fields,…
Growing Growth: Perform Your Own Cohort Analysis with This Open Source Code
Cohort analysis, retention, and churn are some of the key metrics in company building. But this isn’t just another article about cohort analysis. If you’re a seasoned data scientist that already knows the importance of the topic and want to skip the introduction, you can jump to…
Introduction to Apache Spark with Examples and Use Cases
I first heard of Spark in late 2013 when I became interested in Scala, the language in which Spark is written. Some time later, I did a fun data science project trying to predict survival on the Titanic. This turned out…
http://biglogictech.com/ : Hadoop & MongoDB Classes
http://biglogictech.com/training/big-data-analytics/hadoop-architect-hadoop-consultant/ http://biglogictech.com/training/no-sql-databases/mongodb/ Our Popular Training Programs Big Data Expert Enroll Now MongoDB Expert Enroll Now
[MongoDB-Hadoop]: Connectivity testing
Install Hadoop CDH 4/5, Hive, Pig, Java & MongoDB and set environment variables as below. [root@dbversity.com ~]# cat ~/.bashrc | grep “export” export PATH=$PATH:/opt/mongodb/bin export JAVA_HOME=/usr/java/jdk1.8.0_05 export PATH=$JAVA_HOME/bin:$PATH export HADOOP_HOME=/hadoop export PATH=$HADOOP_HOME/bin:$PATH export HIVE_HOME=/hadoop/hive-0.12.0-cdh5.0.0 export PATH=$HIVE_HOME/bin:$PATH export PIG_HOME=/hadoop/pig-0.12.0-cdh5.0.0 export PATH=$PIG_HOME/bin:$PATH…
Hadoop and MongoDB
What is Hadoop? Hadoop is a software technology designed for storing and processing large volumes of data using a cluster of commodity servers and commodity storage. Hadoop is an open-source Apache project started in 2005 by engineers at Yahoo. It…
Intro to Spring Data
The spring-data project offer abstraction and unified data access for variety of data sources. Category Sub-project Relational Databases JPA Spring Data JPA – Simplifies the development of creating a JPA-based data access layer JDBC Extensions Support for Oracle RAC, Advanced Queuing,…
Uninstallation process of CDH (Cloudera Hadoop packages)
Here’are the uninstallation steps 1. Stop ALL services: a. service cloudera-scm-server stop b. service cloudera-scm-server-db stop c. service cloudera-scm-agent hard_stop 2. Uninstall the CDH and Cloudera Manager packages: [srinivas@dbversity.com ~]# hadoop version Hadoop 2.0.0-cdh4.5.0 Subversion file:///var/lib/jenkins/workspace/CDH4.5.0-Packaging-Hadoop/build/cdh4/hadoop/2.0.0-cdh4.5.0/source/hadoop-common-project/hadoop-common -r 30821ec616ee7a21ee8447949b7c6208a8f1e7d8 Compiled by…