GithubHelp home page GithubHelp logo

child-search-expansions's Introduction

component-id type name description work-package pilot project resource release-date release-number release-link doi changelog licence copyright contributors related-components credits
child-search-expansion
WebApplication
Classification and curation of Listening Experiences with LLMs (Demo)
This demo component was developed with the aim of supporting the identification of implicit themes (classification) and metadata (curation) in text. It takes as reference the documentary evidence benchmark
WP4
CHILD
polifonia-project
05/09/2023
v1.0
Apache-2.0
Copyright (c) 2023 CHILD @ The Open University
Jason Carvalho <https://github.com/JaseMK>
Alba Morales Tirado <https://github.com/albamoralest>
informed-by
documentary-evidence-benchmark

Classification and curation of Listening Experiences (Demo)

DOI

This small study, undertaken as part of the wider CHILD pilot, focuses on harnessing LLM technology to classify existing text extracts within LED, a task traditionally performed by human domain experts, to address the challenges posed by the volume of textual data in fields such as music history. Our experiment evaluates the effectiveness of an LLM in categorizing text extracts under the specific theme of childhood, comparing its performance with that of a human domain expert. The comparison aims to quantify the alignment between machine and human interpretations in textual analysis, look at areas where LLM technology may show weaknesses and also investigate if there areas where LLMs are able to shed new light on data that may go unnoticed by humans.


The software included here was developed with the aim of supporting the identification of implicit themes in text and takes as reference the documentary evidence benchmark.

Interactions with the ChatGPT API (or other LLM) is currently handled in the chatgpt.py file. Interactions with the LED knowledge graph are handled in led.py. In order to run any of the scripts in this distribution, a copy of config.py.dist must be made, called config.py, in which a valid OpenAI API key should be specified.

A summary of the experiements performed is provided in 'output/CHILD_text_classification_with_LLM.pdf'

Results and analysis are provided in 'output/ChatGPT-CHILD-Analysis.xlsx'

child-search-expansions's People

Contributors

jasemk avatar albamoralest avatar enridaga avatar

Watchers

 avatar  avatar  avatar

child-search-expansions's Issues

Make expanded set of keywords user-selectable

Once the LLM has expanded the list of keywords, the user should be able to de-select certain terms if they deem them irrelevant and recreate the SPARQL query from the new set of words.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.