微型机与应用Issue(13):42-44,48,4.
基于Hadoop的小文件量化方法研究
Research on the approach of small file cut-off points based on Hadoop
摘要
Abstract
To solve the problem of the small file which could not be handled efficiently by the present Hadoop platform. A method based on least squares curve fitting to ensure “how small is small” is proposed. First and foremost, a criteria for quantifying the access time of the small file is defined. What′s more, the small file access time is used to act as the impact factors of the problem to determine what is a small file. Finally, the means based on the relevant knowledge of linear fitting is found by the experiment of the access time of the different data sets.关键词
Hadoop/小文件问题/曲线拟合的最小二乘法/线性拟合Key words
Hadoop/the small file problem/least squares curve fitting/linear fitting分类
信息技术与安全科学引用本文复制引用
谭跃生,赵玉龙,王静宇..基于Hadoop的小文件量化方法研究[J].微型机与应用,2014,(13):42-44,48,4.基金项目
国家自然科学基金资助项目(61163025);内蒙古自然科学基金资助项目 ()