GithubHelp home page GithubHelp logo

myhhub / knowledgegraph Goto Github PK

View Code? Open in Web Editor NEW
802.0 17.0 93.0 2.62 MB

knowledge graph知识图谱,从零开始构建知识图谱

Shell 0.85% Python 98.69% Batchfile 0.46%
knowledge-graph knowledge-management named-entity-recognition question-answering relation-extraction

knowledgegraph's Introduction

knowledge graph,从零开始构建知识图谱,涵盖基础知识、构建理论、构建实战,从理论到实现。

一、基础知识

  1. 知识图谱基础 之 一.知识图谱基本概念
  2. 知识图谱基础 之 二.知识表示与知识建模
  3. 知识图谱基础 之 三.知识抽取
  4. 知识图谱基础 之 四.知识挖掘
  5. 知识图谱基础 之 五.知识存储
  6. 知识图谱基础 之 六.知识融合
  7. 知识图谱基础 之 七.知识推理
  8. 知识图谱基础 之 八.语义搜索
  9. 知识图谱基础 之 九.知识问答

二、论文方面(构建理论)

论文主要推荐两篇文章

  1. 清华大学杨玉基的“一种准确而高效的领域知识图谱构建方法”。讲述了怎么通过4步进行半自动话的构建领域知识图谱,参考价值极大,步骤清晰。

  2. 华东理工大学胡芳槐的博士论文“基于多种数据源的中文知识图谱构建方法研究”,这篇文章讲了怎么通过多数据源去构建通用知识图谱和行业知识图谱,比较详细的介绍了一些构建技术,具备一定参考价值。

三、博客方面(构建实战)

《从零开始学习知识图谱》系列文章,通过实战码代码,一步一步教你怎么构建一个电影领域知识图谱及百科知识图谱。

  1. 从零开始学习知识图谱(一):电影知识图谱构建 1.半结构化数据的获取
  2. 从零开始学习知识图谱(二):电影知识图谱构建 2.结构化数据到RDF以及基于Apache jena交互
  3. 从零开始学习知识图谱(三):电影知识图谱构建 3.基于REfO的简单知识问答
  4. 从零开始学习知识图谱(四):电影知识图谱构建 4.基于ElasticSearch的简单语义搜索
  5. 从零开始学习知识图谱(五):电影知识图谱构建 5.基于Deepdive非结构化文本关系抽取
  6. 从零开始学习知识图谱(六):电影知识图谱构建 6.将关系型数据存入图数据库Neo4j
  7. 从零开始学习知识图谱(七):百科知识图谱构建 1.百科类知识抽取
  8. 从零开始学习知识图谱(八):百科知识图谱构建 2.数据清洗及存入图数据库Neo4j
  9. 从零开始学习知识图谱(九):百科知识图谱构建 3.基于TensorFlow神经网络关系抽取的数据集构建(使用OpenNRE)
  10. 从零开始学习知识图谱(十):百科知识图谱构建 4.结构化数据到RDF
  11. 从零开始学习知识图谱(十一):百科知识图谱构建 5.Jena使用及SPARQL查询
  12. 从零开始学习知识图谱(十二):百科知识图谱构建 6.基于Silk知识融合
  13. 从零开始学习知识图谱(十三):百科知识图谱构建 7.基于Silk批量知识融合

knowledgegraph's People

Contributors

myhhub avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

knowledgegraph's Issues

百度百科知识图谱的构建3 中得到的train.json和test.json为空

发现gen_re_from_baidu.py中的build_entity_relation() 函数
for line_num in tqdm(range(total_lines)):
re_in_lemma = 0
if count_na + count_re > args.max_sentence:
# re_in_lemma = 0
continue
all_info = inf.readline().strip()
title_disambi_text = all_info.split(",")
if len(title_disambi_text) != 3:
error_counts += 1
continue
这个for循环里面之后的语句没有执行。因为total_lines只有1。

这是运行结果
100%|██████████| 38429/38429 [00:00<00:00, 205980.60it/s]
error_counts: 0
100%|██████████| 1/1 [00:00<00:00, 32.86it/s]
count_re: 0 count_na: 0 count_total: 0
total_sentence_used: 0

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.