Gang Of Coders
Home
About Us
Contact Us
All Hadoop Solutions on Gang of Coders
Total of 85 Hadoop Solutions
How does Hive compare to HBase?
Hadoop
Hbase
Hive
Hadoop on OSX "Unable to load realm info from SCDynamicStore"
Macos
Hadoop
Osx Lion
Difference between hadoop fs -put and hadoop fs -copyFromLocal
Hadoop
Hdfs
how to kill hadoop jobs
Hadoop
Kill
Jobs
Ports are not available: listen tcp 0.0.0.0/50070: bind: An attempt was made to access a socket in a way forbidden by its access permissions
Docker
Hadoop
Port
Docker Image
Scalable Image Storage
Storage
Couchdb
Hadoop
Hbase
Hdfs
PIG how to count a number of rows in alias
Hadoop
Apache Pig
Hbase quickly count number of rows
Hadoop
Hbase
Bigdata
Where does hadoop mapreduce framework send my System.out.print() statements ? (stdout)
Hadoop
Mapreduce
Does Hive have a String split function?
Hadoop
Hive
Namenode not getting started
Hadoop
Hdfs
Just get column names from hive table
Sql
Hadoop
Hive
java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient
Apache
Hadoop
Hive
How to stop/kill Airflow tasks from the UI
Python
Hadoop
Airflow
How to overwrite the existing files using hadoop fs -copyToLocal command
Hadoop
Hadoop "Unable to load native-hadoop library for your platform" warning
Java
Linux
Hadoop
Hadoop2
java.library.path
How to restart a failed task on Airflow
Python
Hadoop
Airflow
Java vs Python on Hadoop
Java
Python
Hadoop
How to load data to hive from HDFS without removing the source file?
Hadoop
Hive
Permission denied at hdfs
Shell
Security
Hadoop
Permissions
Hdfs
Where are logs in Spark on YARN?
Hadoop
Logging
Apache Spark
Cloudera
Hadoop Yarn
What is the relation between 'mapreduce.map.memory.mb' and 'mapred.map.child.java.opts' in Apache Hadoop YARN?
Apache
Hadoop
Configuration
Hadoop Yarn
Heap Size
Hadoop: «ERROR : JAVA_HOME is not set»
Linux
Hadoop
Ubuntu 11.04
Hive load CSV with commas in quoted fields
Hadoop
Hbase
Hive
Hdfs
Delimiter
data block size in HDFS, why 64MB?
Database
Hadoop
Mapreduce
Block
Hdfs
How to Access Hive via Python?
Python
Hadoop
Hive
How to export data from Spark SQL to CSV
Hadoop
Apache Spark
Export to-Csv
Hiveql
Apache Spark-Sql
Difference between Pig and Hive? Why have both?
Hadoop
Hive
Apache Pig
Apache Spark: The number of cores vs. the number of executors
Hadoop
Apache Spark
Hadoop Yarn
When to use Hadoop, HBase, Hive and Pig?
Hadoop
Hbase
Hive
Apache Pig
What are the pros and cons of parquet format compared to other formats?
File
Hadoop
Hdfs
Avro
Parquet
How to turn off INFO logging in Spark?
Python
Scala
Apache Spark
Hadoop
Pyspark
Spark - load CSV file as DataFrame?
Scala
Apache Spark
Hadoop
Apache Spark-Sql
Hdfs
How to copy file from HDFS to the local file system
Hadoop
Copy
Hdfs
What is the difference between partitioning and bucketing a table in Hive ?
Hadoop
Hive
Difference between HBase and Hadoop/HDFS
Hadoop
Nosql
Hbase
Hdfs
Difference
What is the purpose of shuffling and sorting phase in the reducer in Map Reduce Programming?
Sorting
Hadoop
Mapreduce
Hdfs
Shuffle
Name node is in safe mode. Not able to leave
Hadoop
Hdfs
Chaining multiple MapReduce jobs in Hadoop
Hadoop
Mapreduce
Failed to locate the winutils binary in the hadoop binary path
Hadoop
How does Hadoop process records split across block boundaries?
Hadoop
Split
Mapreduce
Block
Hdfs
what's the difference between "hadoop fs" shell commands and "hdfs dfs" shell commands?
Hadoop
Hdfs
Difference between Hive internal tables and external tables?
Hadoop
Hive
Hiveql
connect to host localhost port 22: Connection refused
Linux
Hadoop
Ssh
How does the MapReduce sort algorithm work?
Algorithm
Sorting
Parallel Processing
Hadoop
Mapreduce
The way to check a HDFS directory's size?
Hadoop
Command Line
Directory
Hdfs
Avro vs. Parquet
Hadoop
Avro
Parquet
Can apache spark run without hadoop?
Hadoop
Amazon S3
Apache Spark
Mapreduce
Mesos
hadoop No FileSystem for scheme: file
Java
Hadoop
Io
What is the difference between spark.sql.shuffle.partitions and spark.default.parallelism?
Performance
Apache Spark
Hadoop
Apache Spark-Sql
Is there a .NET equivalent to Apache Hadoop?
C#
.Net
Hadoop
Mapreduce
How to know Hive and Hadoop versions from command prompt?
Hadoop
Hive
Parquet vs ORC vs ORC with Snappy
Hadoop
Hive
Parquet
Snappy
Orc
Container is running beyond memory limits
Hadoop
Mapreduce
Hadoop Yarn
Mrv2
Large scale data processing Hbase vs Cassandra
Nosql
Hadoop
Cassandra
Hbase
Data Processing
How do I output the results of a HiveQL query to CSV?
Database
Hadoop
Hive
Hiveql
When do reduce tasks start in Hadoop?
Hadoop
Mapreduce
Reduce
How to check if ZooKeeper is running or up from command prompt?
Hadoop
Config
Apache Zookeeper
Apache Kafka
Ps
hadoop copy a local file system folder to HDFS
Hadoop
Hdfs
Hadoop truncated/inconsistent counter name
Java
Hadoop
Mapreduce
Hadoop Yarn
Where does Hive store files in HDFS?
Hadoop
Hive
Hdfs
merge output files after reduce phase
Hadoop
Mapreduce
Buiding Hadoop with Eclipse / Maven - Missing artifact jdk.tools:jdk.tools:jar:1.6
Java
Maven
Maven 2
Hadoop
Cloudera
How to Delete a directory from Hadoop cluster which is having comma(,) in its name?
File
Hadoop
Comma
How to delete and update a record in Hive
Hadoop
Hive
Sql Delete
Is there any way to get the column name along with the output while execute any query in Hive?
Hadoop
Hive
Rdbms
What is Hive: Return Code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
Hadoop
Mapreduce
Hive
Hive: how to show all partitions of a table?
Hadoop
Hive
Differences between Amazon S3 and S3n in Hadoop
Hadoop
Amazon S3
Hdfs
Integration testing Hive jobs
Java
Testing
Hadoop
Mapreduce
Hive
HDFS error: could only be replicated to 0 nodes, instead of 1
Amazon Ec2
Hadoop
Hive insert query like SQL
Sql
Hadoop
Hive
Hiveql
Write to multiple outputs by key Spark - one Spark job
Scala
Hadoop
Output
Hdfs
Apache Spark
Why is there no 'hadoop fs -head' shell command?
Hadoop
Hdfs
Hive cluster by vs order by vs sort by
Hadoop
Hql
Hive
HDFS free space available command
Hadoop
Hdfs
How to fix corrupt HDFS FIles
Hadoop
Hdfs
How to check Spark Version
Apache Spark
Hadoop
Cloudera
out of Memory Error in Hadoop
Java
Hadoop
Hadoop cluster setup - java.net.ConnectException: Connection refused
Java
Hadoop
Configuration
Connectexception
Life without JOINs... understanding, and common practices
Orm
Nosql
Hadoop
Join
Bigtable
Stop Java Coffee Cup icon from appearing in the Dock on Mac OSX
Java
Macos
Hadoop
Dock
How to access s3a:// files from Apache Spark?
Hadoop
Apache Spark
Amazon S3
Write a file in hdfs with Java
Java
Hadoop
Hdfs
How does impala provide faster query response compared to hive
Hadoop
Hive
Impala