抽取式文档摘要

内容列表

安装
使用说明

安装

这个项目使用了numpy，jieba，request库，请确保安装了它们

$ pip install numpy
$ pip install jieba
$ pip install request

使用说明

首先运行crawler.py，爬取简书博客的url，被保存在根目录中的url.txt中。再运行bs.py，爬取文章内容，被保存至/data中。最后依次运行gen_sentences.py, tf_and_idf.py, tf-idf.py和summarization.py, 最后生成的摘要被统一保存在/data中。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

抽取式文档摘要

内容列表

安装

使用说明

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
README.md		README.md
bs.py		bs.py
crawler.py		crawler.py
gen_sentences.py		gen_sentences.py
key_word.py		key_word.py
summarization.py		summarization.py
tf-idf.py		tf-idf.py
tf_and_idf.py		tf_and_idf.py

Sidney233/Text_Summarization

Folders and files

Latest commit

History

Repository files navigation

抽取式文档摘要

内容列表

安装

使用说明

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages