deercoder / hadoop-practice Goto Github PK
View Code? Open in Web Editor NEWHadoop demo, my final project for advanced database
Home Page: http://deercoder.github.io/hadoop-practice/
License: GNU General Public License v2.0
Hadoop demo, my final project for advanced database
Home Page: http://deercoder.github.io/hadoop-practice/
License: GNU General Public License v2.0
For hadoop's reduce(), here I just use empty(which means number zero) to output one collection. For example, if there is a city that satisfy the requirement, I just output (city, 0), as reduce() or map() both needs to operate as a tuple. But as a solution it's actually not so good.
But I think it's not elegant, there must be method to eliminate the zero so that there is only the city list for ex1.
Problem Set 1: output the cities where the populations are more than 300,000.
Problem: I will output the city name as well as an 0
behind it.
These two projects are not elegant, many of codes are poorly written and contains some format error, I need to rewrite them in good way...
In ex1, I output the city name, but it seems that the format of hadoop don't support unicode, so my solution will display abnormally, this is a bug.
And I think hadoop should be good at dealing this issue, since many data contains regional code and characters in different languages.
Leave a mark here and will solve it soon.
ex3 doesn't actually output all the countries that is English, it outputs ALL the countries. We have to add other judge conditions.
as title shows, wrong coding for the problem set, don't meet the requirements.
It's quite strange that I can execute the map() function, no error is there, and there is also something output there, but it seems that our map() function is actually not called, since at the very first beginning, log is not output.
Then why this bug happens? And why there is no error?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.