GithubHelp home page GithubHelp logo

yusufakcakaya / nlp-entity-recognition-with-knowledge-graph Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 2.0 64.81 MB

Jupyter Notebook 1.27% PowerShell 0.02% Shell 0.01% Python 94.62% Cython 2.34% C 1.37% C++ 0.32% Batchfile 0.01% JavaScript 0.01% CSS 0.01% Fortran 0.02% Smarty 0.02% XSLT 0.01% Jinja 0.01% HTML 0.01% Roff 0.01%

nlp-entity-recognition-with-knowledge-graph's Introduction

Algorythm-NLP-Entity-Recognition ๐Ÿ—ฃ ๐Ÿค– ๐ŸŽ–

image

The Mission

This project aims to reveal the entity relations by randomly selecting data from from nltk.corpus import reuters data sets and transferring the relationships that emerged after the obtained text information extraction pipeline to the graphical database in neo4j.

image

Text Cleaning : Text cleaning is the process of preparing raw text for NLP (Natural Language Processing) so that machines can understand human language.

Named-entity Recognition : (NER)is an information extraction that seeks to locate and classify named entities in text into pre-defined categories such as the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc.

Screenshot_from_2022-01-20_14-44-03

Coreference Resolution : Coreference resolution is high useful task which is refer to the same entity in a text. It is an important part for the NLP projects to understanding such as document summarization, question answering, and information extraction

Entity Linking : Entity linking is the name we give as long as we find the connection between entities such as names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, which emerge after performing named-entity recognition.

Relationship Extraction : Relationship extraction is the task of extracting semantic relationships from a text. Extracted relationships usually occur between two or more entities of a certain type (e.g. Person, Organisation, Location) and fall into a number of semantic categories (e.g. married to, employed by, lives in)

Graph : The knowledge graph is created by Neo4j. Neo4j stores and manages data in its more natural, connected state, maintaining data relationships that deliver lightning-fast queries, deeper context for analytics and hepls us to understand relations.

How to reach the data?

You can find out how to use the data by clicking this link. Reuters

  • This dataset contains the text of 10,788 news documents totaling 1.3 million words and is publicaly available.

Installation

  • Pull requests are welcome.
  • or clone https://github.com/yusufakcakaya/Algorythm-NLP-Entity-Recognition.git

Repo Architecture

Algorythm-NLP-Entity-Recognition
โ”‚
โ”‚   
โ”‚  
โ”‚__ __ __ datasets                     : datasets for try
โ”‚
โ”‚__ __ 01.Data_exploration.ipynb       : get Retuters data from nltk
|
|__ __ 02.I.E.Pipeline                 : get entity relations            
|
โ”‚__ __ 03.B.K.G_using_NER.ipynb        : creating .csv for neo4j
โ”‚
โ”‚__ __ 04.neo4j.ipynb                  : .py version of neo4j 
โ”‚  
โ”‚__ __ KeatingDataset.csv              : .csv file for neo4j
โ”‚
|__ __ README.md                       : explains the project
|
โ”‚__ __ data.csv                        : .csv file for KeatingDataset.csv
โ”‚                
โ”‚__ __ requirements.txt                  
โ”‚

Visuals

Same visualisations:

It shows relation between source and target.

schema

We can see easily relations between entities.

nr

Collaborators

Design and construction phase of the project was made by 3 collaborators.(Arfa Meher, Nichelle Pinto Machado, Yusuf Akcakaya)

Timeline

  • Type of Challenge: Learning
  • Duration: 2 weeks
  • Deadline: 20/01/2021 4:30 PM
  • Team challenge : group

Good Luck!

nlp-entity-recognition-with-knowledge-graph's People

Contributors

arfameher avatar yusufakcakaya avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.