gbif / bid-projects Goto Github PK
View Code? Open in Web Editor NEWManagement and issues surrounding the BID projects
Management and issues surrounding the BID projects
This piece of SQL will retrieve all datasets where the EML(xml) file contains a project ID that begins with the string 'BID'.
SELECT (xpath('./dataset/project/@id', t1.eml))[1]::text AS eml, t1.dk, t1.dataset_title, t1.publisher FROM
The xpath object navigates the xml hierarchy, returns an array and must be cast as text
(
SELECT convert_from(m.content, 'LATIN1')::xml AS eml, d.key AS dk, d.title as dataset_title, o.title AS
The metadata content field is assumed to be LATIN1 and converted to the DB encoding. Otherwise UTF8 breaking characters will trip up the query.
publisher FROM node
JOIN organization o ON node.key = o.endorsing_node_key
JOIN dataset d ON o.key = d.publishing_organization_key
JOIN metadata m ON d.key = m.dataset_key WHERE
d.deleted IS NULL AND encode(m.content, 'escape') !~ '^\357\273\277|^\377\376'
Regex ignores xml with BOM and UTF16LE BOM
)t1
WHERE trim((xpath('./dataset/project/@id', t1.eml))[1]::text) LIKE '%BID%'
http://api.gbif.org/v1/dataset/search?facet=project_id&limit=0&facetLimit=1000
you’d have to iterate and look at the name...
count is the number of datasets
Notice, e.g.
" bid-af2015-0117-sma",
has bad whitespace
and “- “
"name": "- bid-af2015-0004-nac",
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.