jncc / datahub Goto Github PK
View Code? Open in Web Editor NEWThe JNCC datahub - our online web repository of open data and publications.
The JNCC datahub - our online web repository of open data and publications.
Upgrade the PoC Search page to use the live ES index. Will have to solve authentication between the Beanstalk app and ES.
What database shall we use?
There are several different types of resources or datasets:
The search results need to paginate.
test other document types to check performance
Proof of concept Search / Search Results with basic demo keyword filter by querystring.
static entries with links to individual resource pages
For some reason the NamedProfileCredentialProvider in ElasticsearchService doesn't seem to work.
Some abstracts are written in Markdown. We could support rendering the markdown as HTML.
Test and Live servers.
Update the lambda function to upsert the record into an elastic search index.
Includes create elastic search instance.
Check can do basic full text search.
Meeting for Friday morning - summary of options.
Matt has a new deployment mechanism to replace Felix's - but it needs to be automated.
The full steps to manually deploy are now in the readme.md but this is still a manual process.
Populate with dev data
Questions:
These questions are now less relevant:
The home page is a search page.
...in format from the catalog/inventory and filter (note tweaking needed for multiple resources not viewable in UI, include DOI record- type doi into search to bring it up)
Design notes / discussion for the main Datahub Asset class.
We just need single keyword search by URL and from the asset page keyword, for starters.
Adjust search page to show that it's a keyword search.
It would be nice to have the link to data.gov.uk page if it exists on our resource / asset landing page.
This could be tricky.... The current plan is to (logically at least) publish to the Datahub first, then DGU.
Needs investigation. Could fundamentally affect the publishing mechanism design.
The 'Show more' / 'Show less' implementation was a fairly quick CSS-only solution for the proof-of-concept to long abstracts, and doesn't work very well if the abstract is shorter than 10 lines - the 'Show more' button always shows, and there's excessive whitespace.
This is a definite problem for some records with shorter abstracts.
Plan is to use the main website page. Link to from the footer.
We don't think there' anything else to do really.
The colour hashing code from Topcat needs translating from Coffeescript into C#. There's some subtleties around 'vocabless' keywords too.
creates template and prints out page based on page Topcat ID
Use ideally Shippable to push a build on commit.
Tetrienne and MNR staff are sending round to existing users a copy of their privacy policy as part of GDPR readiness.
They need a list of emails to send the message to - individuals and organisaitonal emails
Please extract from DB as CSV / Excel and sent to Tet.
The website needs revamping which involves reducing the number of webpages. The data, such as publications and scientific documents, from these pages need to be stored, viewable, and searchable by the public from one central place.
How (and should) we do this in the Search page URL for linking to from e.g. website pages?
Need to understand more about how Elastic Search works, whether this is possible, and then whether this is desirable.
The alternative is to have our own search parameters much like Topcat.
We don't want to appear to accidentally claim ownership of something which isn't ours. The Topcat label of Responsible Organisation / Author is not appropriate. Something like:
Organisation
JNCC Distributor
[email protected]
We need a queue to handle the website republish.
Make an HTTP API which accepts a JSON record and upserts it into a Dynamo table.
Implement the Search page.
Complete the implementation of a command line tool for Creating, Populating and Deleting index entries.
PDF search needs more 'proving' asap.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.