I am:
-
a software engineer
-
a master in computer science
-
from the amazing city of Porto, Portugal
📧 You can reach me at [email protected]
Repository to host the Markup Languages and Document Processing project, a fourth year course @FEUP
I am:
a software engineer
a master in computer science
from the amazing city of Porto, Portugal
📧 You can reach me at [email protected]
I suggest Wednesday afternoon for a rehearsal of the Pitch and live demo.
Make this beauty public
This function is too long and should be split into more readable ones, in doing so, one should also go for the parallelization of the operations, if such is deemed relevant by the group. The function is in the stuns.py
file
in the structure_the_unstructured
a new dataset is always created regardless of being a duplicate, this should be handle by, for instance, performing a hash of all the files in the dataset folder and checking if it already exists, it should also be included as an optional operation by adding it as a boolean value to the argparse options (see the --verbose
case)
This will include looking at how it was working before and how the data it received changed, some html knowledge is required
This involves updating the extract_metrics
method in the sensor
class, if necessary implement it in child classes that call this one. The metrics are the ones in the report and the ones mentioned in the FRAUNHOFER/FH meeting
Sample code:
from bson.binary import JAVA_LEGACY
from bson.codec_options import CodecOptions
from pymongo import MongoClient
from uuid import uuid4
client = MongoClient(...)
db = client.get_database(<DB_NAME>, CodecOptions(uuid_representation=JAVA_LEGACY)) #DB_NAME should be demdata_db in our case
id = db.cenas.insert_one({'_id': uuid4(), 'other': '...'}).inserted_id #when inserting new documents into the DB, set the _id property as done here
print(id) #printed as UUID4 (converted by driver), in mongo shell appears as BinData(3, ...)
After the code
A cleanup of each "subproduct" (api, app) 's README.md file is required and also a review for the packages used as only those that are strictly necessary, so:
This task includes a lot of copy paste and should happen after #5
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.