Comments (1)
I missed a detail: While the metadata items in the file above are what Aleph accepts, the items you can pass from the aleph_emit op in Memorious are actually limited by these two methods: https://github.com/alephdata/memorious/blob/main/memorious/operations/aleph.py#L14-L47
As far as I can see, items supported by the Aleph Ingest API but not in Memorious are:
- authored_at
- date
- generator
- mime_type
- summary
Itβs all a bit confusing to be honest, because these metadata items also do not always map 1:1 to FtM properties, I guess that is a relict of the fact that documents have been a separate concept (and not entities) in the first versions of Aleph.
from memorious.
Related Issues (20)
- documentcloud integration may need to be reviewed HOT 4
- Prevent the docker build/push from running on forks of memorious HOT 2
- Retry to establish database and redis connection a few times before raising an error
- documentcloud operation should parse `publisher` document metadata and `aleph_emit` should be able to push it to Aleph
- Using the standard parse function for creating entities does not generate an entity_id HOT 1
- Ingesting multiple files from a single page into Aleph and creating ftm entities HOT 1
- `aleph_emit` fails with data validation error
- Do some test stuff HOT 1
- Possible improvements to how we test Memorious
- Example won't run `no module named 'example'` HOT 3
- Filepaths that are longer than 255 characters should be shortened HOT 1
- Memorious session information expiration HOT 1
- Create an image which supports python 3.10 HOT 1
- nothing
- Rename master branch to main
- sqlachemy version
- malformed yaml results to an error in listing crawlers
- Is it possible to prerender pages before parsing?
- example: version format is invalid
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from memorious.