GithubHelp home page GithubHelp logo

dice-group / eslm Goto Github PK

View Code? Open in Web Editor NEW
1.0 4.0 0.0 70.75 MB

ESLM: An approach to improve the performance of entity summarization by leveraging language models and enrichment language model using knowledge graph embedding

License: GNU General Public License v3.0

Python 99.92% Shell 0.08%
entity-summarization enrichment-language-model-using-kge contextual-language-models

eslm's Introduction

ESLM: Improving entity summarization by leveraging language models

The ESLM model is designed to efficiently perform entity summarization tasks. At its core, ESLM utilizes a language model augmented with an attention mechanism, allowing for context-aware processing and prediction. This mechanism ensures that the model focuses on the most relevant aspects of data sequences. Additionally, ESLM employs a Multi-Layer Perceptron (MLP) for accurate triple scoring, which aids in selecting the most relevant triples for each entity. To augment the model’s effectiveness, we also enriched it with knowledge graph embeddings (KGE), leveraging their rich semantic information.

Dataset

On this experiment, we used two datasets:

  1. ESBM (version 1.2) includes 175 entities, comprising 150 entities sourced from DBpedia and 25 entities from LinkedMDB.
  2. FACES dataset is made up of 50 entities.

Environment and Dependency

Environment

  • Ubuntu 10.04.2 LTS
  • python 3.6+
  • pytorch 1.7.0

Installation

https://github.com/u2018/ESLM.git

ESLM Framework

Currently, ESLM framework includes a selection of three highly effective models for entity summarization tasks as follows:

  1. BERT (Bidirectional Encoder Representations from Transformers)
  2. ERNIE (Enhanced Representation through kNowledge Integration)
  3. T5 (Text-to-Text Transfer Transformer)

To utilize these models in the ESLM framework, you simply need to reference them by their lowercase names in the command line. For instance:

  1. To use BERT, you would type bert.
  2. For ERNIE, type ernie.
  3. And for T5, enter t5

Usage information

main.py [-h] [--train] [--no-train] [--test] [--no-test] [--enrichment] [--no-enrichment] [--model MODEL] [--max_length MAX_LENGTH] [--epochs EPOCHS] [--learning_rate LEARNING_RATE]

ESLM

Training and evaluating the model using the standard settings

python main.py --model t5  --no-enrichment

Evaluating only

python main.py --model t5  --no-enrichment --no-train

ESLM Enrichment with KGE

Training and evaluating the model using the standard settings

python main.py --model t5

Evaluating only

python main.py --model t5 --no-train

How to Cite

@inproceedings{ESLM2024,
  author = {Firmansyah, Asep Fajar and Mousallem, Diego and Ngonga Ngomo, Axel-Cyrille},
  booktitle = {The Semantic Web},
  title = {ESLM: Improving Entity Summarization by Leveraging Language Models},
  url = {https://doi.org/10.1007/978-3-031-60626-7_9},
  year = 2024
}

Contact

If you have any questions or feedbacks, feel free to contact us at [email protected]

eslm's People

Contributors

asep-fajar-firmansyah avatar

Stargazers

Ria Hari Gusmita avatar

Watchers

René Speck avatar Axel Ngonga avatar Stefan Heindorf avatar Kostas Georgiou avatar

eslm's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.