Hadoop is based on distributed computing having HDFS file system (Hadoop Distributed File System). Hadoop is highly fault-tolerant and can be deployed on low cost hardware. Hadoop is very much suitable for high volume of data and it also provide the high speed access to the data of the application which we want to use. hadoop architecture is cluster based, which is consist of nodes(data note, name node), physically separate to each other, in ideal condition. The performance of hadoop can be increased by proper assignment of the tasks in the default scheduler. In hadoop a program known as map-reduce is used to collect data according to query. As hadoop is used for huge amount of data therefore scheduling in hadoop must be efficient for better performance. The research objective is to study and analyse various scheduling techniques, which are used to increase performance in hadoop.
Copyright © 2025 IJRTS Publications. All Rights Reserved | Developed By iNet Business Hub