In this post I will show you how to check Hadoop" version using CLI. When we create the application which will be run on the cluster we firstly must know what Hadoop" version is used on our cluster to be compatible.
Table of Contents
Introduction
Hadoop
Apache Hadoop" is an open-source software framework for distributed storage and distributed processing of very large data sets on computer clusters. Hadoop" consists of two main components: the Hadoop" Distributed File System (HDFS") and the MapReduce programming" model.
HDFS" is a distributed file system that allows users to store large amounts of data across multiple machines in a Hadoop" cluster. It is designed to be fault-tolerant, meaning that it can continue to operate even if one or more of the machines in the cluster fail.
MapReduce is a programming model that is designed to process large amounts of data in parallel across a Hadoop" cluster. It is based on the idea" of breaking a large data processing task into smaller chunks (map tasks) that can be distributed across the cluster and then combining the results of those tasks (reduce tasks).
Hadoop" is often used for storing and processing large data sets for tasks such as search indexing, data mining, machine learning", and log processing. It is a popular choice for companies that need to handle large amounts of data and are looking for a cost-effective and scalable solution.
Hadoop Releases
Here is a list of the major releases of Apache Hadoop", along with their release dates:
- Hadoop" 1.0.0: December 2011
- Hadoop" 1.0.1: February 2012
- Hadoop" 1.1.0: May 2012
- Hadoop" 1.2.0: October 2012
- Hadoop" 2.0.0: October 2013
- Hadoop" 2.2.0: August 2014
- Hadoop" 2.4.0: August 2015
- Hadoop" 2.6.0: November 2015
- Hadoop" 2.7.0: April 2016
- Hadoop" 2.8.0: December 2016
- Hadoop" 2.9.0: October 2017
- Hadoop" 3".0.0: December 2017
- Hadoop" 3".1.0: April 2018
- Hadoop" 3".2.0: November 2018
- Hadoop" 3".3.0: May 2019
- Hadoop" 3".4.0: December 2019
Note that these are the major releases of Hadoop", and that there have been numerous minor releases and point releases between each major release. You can find more information about the history of Hadoop" and the various releases on the Apache Hadoop" website.
Command Line Interface (CLI)
The command line interface" (CLI) is a means of interacting with a computer program by typing commands to it and receiving output in the form of text. It is a text-based interface that allows users to enter commands using a keyboard, rather than using a graphical user" interface (GUI) with a mouse and icons.
CLIs are often used in operating systems and servers to perform tasks such as managing files, installing software, and configuring system settings. They are also commonly used in programming" languages and database" systems to execute commands and scripts.
To use a CLI, a user" typically opens a terminal window or a command prompt and types in a command. The command is then passed to the program, which executes the command and returns the output. Some CLIs allow users to enter multiple commands on a single line, separated by semicolons or other delimiters.
CLIs can be more efficient than GUI-based interfaces for tasks that are performed frequently or that require a high level of precision. They are also often preferred by experienced users who are comfortable with typing commands and who do not need the visual cues provided by a GUI.
How To Check Hadoop Version?
To check the Hadoop version you can use command line interface" (CLI).
$ hadoop version ========================================================= Hadoop 2.6.0-cdh5.16.2
In my case the version is: Hadoop" 2.6.0-cdh5.16.2.
You can also use the hadoop version -full
command to display more detailed information about the installed version of Hadoop", including the build date and the checksum of the source code.
Keep in mind that the hadoop version
command may not work if the hadoop
executable is not in your system’s PATH
environment variable. In this case, you will need to specify the full path to the hadoop
executable in order to run the command.
Summary
Now you know how to check Hadoop version and use this information to provide correct dependency when you’re creating the applications which will be running on the cluster.
Could You Please Share This Post?
I appreciate It And Thank YOU! :)
Have A Nice Day!