str.splitlines-Python合辑3-字符串用法深度总结 - 码小课

当前位置:　首页>> 技术小册>> Python合辑3-字符串用法深度总结

str.splitlines(keepends=False)

有时想处理一个在边界处具有不同换行符（’\n’、\n\n’、’\r’、’\r\n’）的语料库。要拆分成句子，而不是单个单词。可以使用 splitline 方法来执行此操作。当 keepends=True 时，文本中包含换行符；否则它们被排除在外

import nltk  # You may have to `pip install nltk` to use this library.
macbeth = nltk.corpus.gutenberg.raw('shakespeare-macbeth.txt')
print(macbeth.splitlines(keepends=True)[:5])

Output:

['[The Tragedie of Macbeth by William Shakespeare 1603]\n', '\n', '\n', 'Actus Primus. Scoena Prima.\n', '\n']

该分类下的相关小册推荐：

Python数据分析与挖掘实战(上)

Python数据分析与挖掘实战(上)

Python自动化办公实战

机器学习算法原理与实战

Python合辑6-字典专题

Python合辑1-Python语言基础

Python合辑1-Python语言基础

Python面试指南

Python数据分析与挖掘实战(下)

Python数据分析与挖掘实战(下)

剑指Python(万变不离其宗)

剑指Python(万变不离其宗)

Python与办公-玩转PDF

Python与办公-玩转PDF

Python爬虫入门与实战开发(下)

Python爬虫入门与实战开发(下)

Python合辑14-面向对象编程案例(下)

Python合辑14-面向对象编程案例(下)

Python3网络爬虫开发实战(上)

Python3网络爬虫开发实战(上)