Comments (10)
Hi @dmzio, @thomwolf can chime in, but yes we are working on releasing the (pyTorch powered) training workflow for this model.
Subscribe to get updates!
from neuralcoref.
🎉🎉 Just released v2.0 🎉🎉
Here's a blogpost detailing our work: https://medium.com/huggingface/how-to-train-a-neural-coreference-model-neuralcoref-2-7bb30c1abdfe
We can help anyone who would like to train on a new language.
Feedback appreciated.
from neuralcoref.
Hi @julien-c, I was thinking of the ANCOR corpus [1] for French. It's > 400k words and encodes entities, their properties and anaphoric relations. Do you think it would do the trick?
[1] https://hal.archives-ouvertes.fr/hal-01016562/document
from neuralcoref.
Hello @dmzio and @thomwolf, any update on the training code? The library is really interesting, unfortunately without the ability to train a model it'll remain English-only and many of us can't use that at all. Congrats on the lib though, it's very cool.
from neuralcoref.
Hi @bolaft, quick question – which datasets would you use the training code on? (there are not that many coreference datasets in our experience, so we're super interested of the ones you know about)
from neuralcoref.
Updates on this would be interesting. It would be nice to maybe document an abstract schema for how the data should look like, so the community can write specific transformation tools to transform corpora from other languages into something that the pipeline can consume effortlessly.
from neuralcoref.
Thanks for the feedback @dmzio @bolaft @tgalery.
We will be releasing the training code + associated documentation next week.
In the meantime, you can follow our progress on the training branch here: https://github.com/huggingface/neuralcoref/commits/training
from neuralcoref.
How long does it take to train the model (on CPU and on GPU)?
from neuralcoref.
About a day total on GPU
from neuralcoref.
We are now on release v3.0 so I am closing this old issue.
Feel free to open it again (or a new one) if you are experiencing some issues with the new release.
from neuralcoref.
Related Issues (20)
- Wrong average embedding during inference due to a small bug in neuracoref.pyx
- Missing implementation of doc embeddings during inference
- Wrong Mention Type one-hot vectors during training due to a small bug in dataset.py
- Training Dataset Format
- Can't install neuralcoref, keep getting this error: C:\\Program Files (x86)\\Microsoft Visual Studio\\2019\\BuildTools\\VC\\Tools\\MSVC\\14.29.30133\\bin\\HostX86\\x64\\cl.exe' failed with exit code 2 HOT 4
- Error in training without changing anything from the default instructions
- GPU support - cuda 11.1 - TypeError: Unsupported type <class 'numpy.ndarray'>
- (base) C:\Users\sk136\neuralcoref>python -m neuralcoref.train.learn --train ./data/train/ --eval ./data/dev/ facing problem while executing.. this command HOT 1
- Results completely differ from web-demo
- Compatibility with Spacy 3+ HOT 7
- Regarding finetuning neuralcoref
- dels HOT 1
- neuralcoref not supporting python 3.9 version HOT 1
- spacy.strings.StringStore size changed, may indicate binary incompatibility HOT 5
- Dependency Problem HOT 1
- I can't install neuralcoref HOT 8
- Unresolved dependencies?
- Kernel crashes when trying to run demo code HOT 1
- Process finished with exit code -1073741819 (0xC0000005)
- installation failed with HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from neuralcoref.