hadoop
Projects with this topic
-
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, ...
Updated -
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
Updated -
Helper image that extends Java build tools container with additional prereqs needed to compile Hadoop from source for ARM64. This includes backported patches for things like protobuf.
Updated -
Graduate Tools and Models - Data Science.
Rice University, Spring 2024.
Updated -
-
Strategic Information Management - Hadoop with Spark
Updated -
This web app finds the best configuration of a Spark Application given the hardware of the cluster
Updated -
Workshop de Big Data a cargo de Jimmy Farfán docente del curso online "Desarrollo de Aplicaciones de Big Data en Hadoop". Si requieren más información o cualquier duda pueden ubicarnos en facebook como Data Hack Formation.
Updated -
-
-
-
This project was an exercise for the Master in Big Data Engineering and Data Science at "Universidad Autónoma de Madrid". See the readme.md for more information.
Updated -
This project performs SQL operations on a CSV input in HDFS, using Hadoop's Map-Reduce.
Updated -
-
A docker image for Halyard SDK (including Apache Hadoop and Apache HBase).
Updated -
A base docker image for Apache Hadoop and Apache HBase clients.
Updated -
A coherent introduction to the Hadoop environment and HDFS.
Updated -
This project aggregates trending data from Ukraine based Twitter accounts. The raw aggregated data is cleansed before analysis using some Big-data methods. The purpose of this project is to familiarize myself with the workings of Hadoop for HDFS and Map-Reduce infrastructure.
Updated -
Explaining usage of classroom spaces and trending subjects using classroom usage data.
Updated -
Map/Reduce application that analyzes movie ratings collected by Movielens, leveraging Hadoop MapReduce, Hadoop Distributed File System and Apache Flume.
Coursework in Structures and Architectures for Big Data 2016/2017.
UpdatedUpdated