IJRTS Publications

SCHEDULING IN HADOOP

Poonam Madaan

Page No. : 19-22

ABSTRACT

Hadoop is based on distributed computing having HDFS file system (Hadoop Distributed File System). Hadoop is highly fault-tolerant and can be deployed on low cost hardware. Hadoop is very much suitable for high volume of data and it also provide the high speed access to the data of the application which we want to use. hadoop architecture is cluster based, which is consist of nodes(data note, name node), physically separate to each other, in ideal condition. The performance of hadoop can be increased by proper assignment of the tasks in the default scheduler. In hadoop a program known as map-reduce is used to collect data according to query. As hadoop is used for huge amount of data therefore scheduling in hadoop must be efficient for better performance. The research objective is to study and analyse various scheduling techniques, which are used to increase performance in hadoop.

FULL TEXT

PDF

Multidisciplinary Coverage

Agriculture
Applied Science
Biotechnology
Commerce & Management
Engineering
Human Social Science
Language & Literature
Mathematics & Statistics
Medical Research
Sanskrit & Vedic Sciences

Archives

SCHEDULING IN HADOOP

ABSTRACT

FULL TEXT

Multidisciplinary Coverage