Comments (4)
Need to add Porter Stem to Lucene to recognize stemming.
from reciter.
Please try first to use the snowball stemmer, available on GitHub here:
https://github.com/snowballstem
As an alternative, Stanford CoreNLP is available here:
http://nlp.stanford.edu/software/corenlp.shtml
Background information on stemming and lemmatization is available here:
http://nlp.stanford.edu/IR-book/html/htmledition/stemming-and-lemmatization-1.html
from reciter.
In phase one, stemming is to be applied to journal title, MeSH major terms, and article title (see Phase One - G in ReCiter Architecture and Data Processing Operations)
In phase two, stemming is to be applied to board certifications and department names (see Phase Two - H in ReCiter Architecture and Data Processing Operations)
from reciter.
Hanumantha has integrated the code into ReCiterCWIDData.java starting at line 164; this code is run by the code for #84.
from reciter.
Related Issues (20)
- Update MeSHterm.json
- First name scoring does not properly match in cases where nameMatchFirstType should be "full-conflictingAllButInitials"
- Failure to score article first name in cases where institutional first name contains space or dash
- Feature Generator by Group API should accept input of an array of person IDs HOT 2
- Fields parameter in Feature Generator not working as expected
- Feature Generator outputs in a single article suggestion pieces of two separate article records HOT 1
- 500 Internal Server Error for _dar7342 HOT 1
- Application returns 500 error if "emails" field is null
- Refactor publication type assignment
- DOI is parsed incorrectly HOT 1
- No documentation on how to use Reciter and the Reciter Pubmed retrieval tool HOT 6
- Create new publication type, "Erratum"
- Add first name likelihood scoring strategy
- Investigate 404 errors
- Switch to using environmental variables
- App throws an error if firstName field is blank
- Output equalContrib as an author level attribute
- Update the way ReCiter handles books HOT 1
- Downweight cases where org unit doesn't match
- Look up candidate records by names of collaborators
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from reciter.