Elansary Mahmoud's Projects
Comparing the performance of Apache Spark to Python's Pandas in popular data analysis tasks such as joins, aggregates and where clauses
The Common Data Model (CDM) is a standard and extensible collection of schemas (entities, attributes, relationships) that represents business concepts and activities with well-defined semantics, to facilitate data interoperability. Examples of entities include: Account, Contact, Lead, Opportunity, Product, etc.
"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook
Demonstration of using Files in Repos with Databricks Delta Live Tables
Just another repository
Literature Study
Easy example how to use sliders and dropdown menus. Detailed example in my blog post: https://sylwiamielnicka.com/blog/advanced-plotly-sliders-and-dropdown-menus/
How do we measure the degradation of a machine learning process? Why does the performance of our predictive models decrease? Maybe it is that a data source has changed (one or more variables) or maybe what changes is the relationship of these variables with the target we want to predict. `pydrift` tries to facilitate this task to the data scientist, performing this kind of checks and somehow measuring that degradation.
Search UI. Libraries for the fast development of modern, engaging search experiences.