Comments (4)
I saw a twitter conversation today, where Matthew wrote that sentence boundary detection (that does not require dependency parsing) is on the roadmap for spaCy v2. (https://twitter.com/honnibal/status/860395803826937856).
But if you have a Python module that detects sentence boundaries on a raw text, It would be no problem to integrate this in the API as separate entry point.
from spacy-api-docker.
Hi Johannes,
Thanks for replying. I'm a bit confused. I thought the current version of spaCy already does sentence boundary detection?:
"Improved sentence segmentation now included in the latest release. Docs are updated with usage."
I'd like to be able to submit an entire document to the server and have it return a JSON array of all segmented sentences.
from spacy-api-docker.
Same for me. /dep
will return a list of annotated tokens but it would be nice to have something like this:
{
'sentence': [
{'index': 0, 'dep': [...]},
{'index': 1, 'dep': [...]},
]
}
Similar to the JSON result that's provided by the Stanford CoreNLP Server.
from spacy-api-docker.
I agree this is probably a better a format when working with the output.
However, I'm currently not using this API in any of my projects, so development has a rather low priority for me.
But if you want to change the format, have a look at this part:
spacy-api-docker/displacy_service/parse.py
Lines 27 to 49 in 78585e8
I'm always happy to receive pull requests if you make any changes :)
from spacy-api-docker.
Related Issues (20)
- The docker documentation is incorrect and confusing HOT 3
- Adding a new entry point for POS tagger only output
- Some requests hang forever HOT 3
- Use different ports behind nginx HOT 4
- Similarity route?
- /dep HOT 3
- How can I download en_core_web_lg? HOT 8
- DEFECT: Latest docker container is broken - Displacy UI doesn't work HOT 4
- Update to Spacy 2.2 HOT 1
- Lemme of words HOT 1
- "Schema construction failed" when getting schema on model "en_v2"
- Is it possible to train data? HOT 2
- How can I download fr_core_news_md?
- Frontend and API exit with status code 2
- text classification rest api
- Spacy 2.3.0/3.x support HOT 2
- Greek language support
- Upgrade to Falcon 3 for CORS
- Displacy is no longer maintaned since 2.0.0 HOT 1
- The REST API returns JSON payload with Content-Type "text/string"
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spacy-api-docker.