[雪峰磁针石博客]大数据Hadoop工具python教程2-python访问HDFS
https://pypi.org/project/hdfs3 已经不维护
PyArrow
https://pypi.org/project/hdfs/
https://pypi.org/project/snakebite/ python2中比较好,对python3支持不好。
hdfs和PyArrow比较常用,这里以hdfs为例:
快速入门
from hdfs import InsecureClient
client = InsecureClient('http://localhost:50070', user='hduser_')
fs_folders_list = client.list("/")
print(fs_folders_list)
with client.read('/user/hduser/input.txt', enco