GithubHelp home page GithubHelp logo

gensim-core-concepts's Introduction

Gensim Core Concepts - Exploration

As per here though with different sample data and a few tweaks.

Work Breakdown Structure:

High Priority:

  • Add computer science/programming controlled vocabularies. #DONE
  • Parse software engineering sample_corpus with gensim utilities. #DONE
  • Corpus Streaming โ€“ Build dictionary streaming one document at a time (data/build_corpus_dictionary.py) #DONE
  • Explore Corpus Formats #DONE
  • Topics and Transformations #FOCUS
  • Create bespoke MyCorpus class in .classes/. #TODO

Mid Priority:

  • Design ontology model for software engineering domain. #TODO
    • Include marker for single/few-letter domain words (i.e. "c", "R", etc.)
    • Identify relevance and utility of tags present in stackexchange_tags.tag_description (surrounded by square brackets).
      • Can a reliable relationship be established between them and the parent tag?
  • Build modelled controlled vocabulary for software engineering domain. #TODO

Lower Priority:

  • Design mechanism to disambiguate single/few-letter domain words from non-domain instances. #TODO
<style> todo { background-color: Yellow; color: SteelBlue } recurrent { background-color: Gold; color: SteelBlue } refactor { background-color: SpringGreen; color: DarkGreen } done { background-color: Green; color: PaleGreen } test { background-color: Coral; color: DarkRed } fixme { background-color: Crimson; color: white } focus { background-color: DeepSkyBlue; color: MediumBlue } reqspec { background-color: MediumPurple; color: Indigo } doubt { background-color: #FF00FF; color: Yellow } wait { background-color: Pink; color: Crimson } blocked { background-color: Crimson; color: Yellow } </style>

gensim-core-concepts's People

Contributors

leobeeson avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.