Comments (2)
04-05-15 Update:
- Updated project name to ReCiter.
- Updated Lucene code to index title, journal, keyword field without tokenizing.
- Still need to add an untokenized affiliation field.
- Added a slf4j logger. Outputs the following metadata:
- article pmid
- article title
- article journal title
- article author names
Still need to work on outputting:
- article author affiliation
Outputs precision and recall value for similarity thresholds from 0.1 to 0.9 in increments of 0.1.
Outputs the best similarity threshold out of these 9 values (0.1 to 0.9).
Outputs a CSV-like format to reciter.log file for each similarity threshold.
Note:
- Make sure to have a local db connection (or change the db config in DbConnectionFactory.java) to enable ReCiter to retrieve the gold standard information.
- To run the ReCiterExample.java for a specific person, please change the cwid, first name, middle name, last name in config.properties file in the project workspace.
- The reciter.log file appends any log outputs, so to ensure a clean log, try to delete to file every time you run the ReCiterExample.java file for a particular cwid.
from reciter.
- Beta version of CSV output finished.
- Added AnalysisCSVWriter.java to output CSV output with file name csv_output.csv.
- Added more options to the file config.properties.
a). authorKeywords: keywords which relates to the author.
b). similarityThreshold: cosine similarity
c). coAuthors: co-authors of this author. - Running ReCiterExample.java will produce the csv file with configurations in config.properties.
from reciter.
Related Issues (20)
- Update MeSHterm.json
- First name scoring does not properly match in cases where nameMatchFirstType should be "full-conflictingAllButInitials"
- Failure to score article first name in cases where institutional first name contains space or dash
- Feature Generator by Group API should accept input of an array of person IDs HOT 2
- Fields parameter in Feature Generator not working as expected
- Feature Generator outputs in a single article suggestion pieces of two separate article records HOT 1
- 500 Internal Server Error for _dar7342 HOT 1
- Application returns 500 error if "emails" field is null
- Refactor publication type assignment
- DOI is parsed incorrectly HOT 1
- No documentation on how to use Reciter and the Reciter Pubmed retrieval tool HOT 6
- Create new publication type, "Erratum"
- Add first name likelihood scoring strategy
- Investigate 404 errors
- Switch to using environmental variables
- App throws an error if firstName field is blank
- Output equalContrib as an author level attribute
- Update the way ReCiter handles books HOT 1
- Downweight cases where org unit doesn't match
- Look up candidate records by names of collaborators
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from reciter.