Requirement:
Input: Each input is a *.html file, which is a downloaded webpage for a faculty member listed on http://www.cs.txstate.edu/Personnel/Faculty Links to an external site.(e.g., http://www.cs.txstate.edu/Personnel/jg66Links to an external site.). Your program only needs to work offline locally on the downloaded input pages.
Output: Each output is a *.txt file, which contains a tabular form similar to the following, with the requested information (italic as shown below) correctly extracted from the corresponding input file.
Name: Ju (Byron) Gao
Education: BS, PhD, Simon Fraser University
Research interests: Data mining, databases, information retrieval
Office: CMAL 311D
Webpage: http://cs.txstate.edu/~jg66