GithubHelp home page GithubHelp logo

Formatting Abstracts about mat2vec HOT 2 CLOSED

emielke12 avatar emielke12 commented on September 24, 2024
Formatting Abstracts

from mat2vec.

Comments (2)

vtshitoyan avatar vtshitoyan commented on September 24, 2024

In principle, you don't have to do any special formatting. However, in the original paper, we used some pre-processing to reduce the size of the vocabulary and improve tokenization. This is beneficial if the text you are dealing with has to do with materials science/chemistry. You can use the process method here, then join back the tokens and dump it to the text corpus file. Let me know if this answers your question and I will close the issue.

from mat2vec.

emielke12 avatar emielke12 commented on September 24, 2024

Yes this answers my question. Thanks!

from mat2vec.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.