GithubHelp home page GithubHelp logo

harishkamathuk / harishkamath-needle Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 1.69 MB

Automatically exported from code.google.com/p/harishkamath-needle

PHP 99.52% Shell 0.05% CSS 0.39% ApacheConf 0.01% JavaScript 0.01% Batchfile 0.04%

harishkamath-needle's People

Watchers

 avatar

harishkamath-needle's Issues

Write all messages to a log file

What is the feature that is being added to the Parking Log?

Currently, all messages are printed to the script - this needs to go to a log 
file.

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

This is critical as it is easier to review a log file rather than printing out 
to the screen.

Please provide any additional information below. Links, resources etc.

Use default CakeLog features - should be a quick thing to implement.

Original issue reported on code.google.com by harishkamath on 26 Apr 2014 at 2:08

Email from needle application

What is the feature that is being added to the Parking Log?

Email functionality in the needle application

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Nice to have - this is existing behaviour in the scrape application as it 
emails a CSV file with the NAVs


Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 28 Apr 2014 at 5:30

Convert needle application into a Plugin

What is the feature that is being added to the Parking Log?
Make the application needle as a Plugin for CakePHP application

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?
This is a high priority as the ultimate goal is to integrate this into the main 
pman application.

Please provide any additional information below. Links, resources etc.
See CakePHP documentation on how to build a plugin
Things to look out:

- decouple the application to be re-usable
- how to incorporate db dependencies



Original issue reported on code.google.com by harishkamath on 22 Apr 2014 at 11:44

Associate multiple needles with a URL (to search haystack)

What is the feature that is being added to the Parking Log?

Ability to associate multiple needles with a URL. This will allow the 
application to search for different text in the haystack. Store the results in 
the separate location for future reference

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?
It is a show stopper - will need to prioritised


Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 22 Apr 2014 at 9:00

The Scrapable behaviour needs better HttpSocket implementation

What is the feature that is being added to the Parking Log?

The current implementation is very basic. Need the ability to post data if 
required. Input parameter should be better handled.


How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Nice to have - can be prototype of input argument processing for custom 
functions


Please provide any additional information below. Links, resources etc.




Original issue reported on code.google.com by harishkamath on 23 Apr 2014 at 9:02

URL was not returning valid haystack

What steps will reproduce the problem?
1. Fetch Haystack
2. Mine for Needle
3. No valid NeedleHit found - scrapped data was invalid

What is the expected output? What do you see instead?
Expect to view NAV and IMA Sector for the URLs in question - no data retrieved.

What version of the product are you using? On what operating system?


Please provide any additional information below.


Original issue reported on code.google.com by harishkamath on 27 Apr 2014 at 11:36

Remove special UTF8 characters from needles or haystacks

What is the feature that is being added to the Parking Log?

Build a Helper for the application in general. This can contain function not 
available in CakePHP. For example converting UTF strung into ASCII that is not 
removed by simple string function. 

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

This is critical as it will ensure that we use the needle found for the plan 
application.

Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 23 Apr 2014 at 8:51

The "mine" function should have a "less" greedy option

What is the feature that is being added to the Parking Log?

Currently, the default behaviour of the "mine function is to look for first 
instance of the needle start and needle end values. The function should have 
flag that indicate whether the function call should be greedy or not i.e. it 
should return as much (greedy = true) or as little (greedy=false) feature.

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Nice to have. This will ensure that there are more relevant needles that can be 
created. For example, the ISIN needle does not work as the word is present 
twice on the Morningstar website and the function returns a lot more data than 
it should.


Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 27 Apr 2014 at 11:44

Filtering should be used when processing scraped HTML

What is the feature that is being added to the Parking Log?

Since we are scraping various website, it makes sense to scrap all the HTML 
from the URL being scrapped.

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Often websites have complex HTML and JS code that is not required to be stored 
in the database - something that can strip off the HTML will be useful.


Please provide any additional information below. Links, resources etc.

http://book.cakephp.org/2.0/en/core-utility-libraries/sanitize.html

http://htmlpurifier.org/


Original issue reported on code.google.com by harishkamath on 26 Apr 2014 at 2:03

phpUnit integration with needle

What is the feature that is being added to the Parking Log?

Good practice and get to learn more about unit testing

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Nice to have - personal knowledge


Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 22 Apr 2014 at 8:46

Input all morningstar URLs in the urls table

What is the feature that is being added to the Parking Log?

Populate the data required for BAU use

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Critical

Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 25 Apr 2014 at 6:17

Create Console application using needle to enable it to run in the background

What is the feature that is being added to the Parking Log?

Create a CakePHP Console application to run the screen scrapper

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

This is critical task in order to move away from GoDaddy.


Please provide any additional information below. Links, resources etc.

Note that the UTF-8 character bug.


Original issue reported on code.google.com by harishkamath on 22 Apr 2014 at 6:23

Create a config file for the application for constants and config values

What is the feature that is being added to the Parking Log?

This is to ensure that the application can ported easily using config files

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Critical as there are many hard coded values in the code base today


Please provide any additional information below. Links, resources etc.

See Config file section in the CakePHP documentation

Original issue reported on code.google.com by harishkamath on 25 Apr 2014 at 5:21

Haystack associate with NeedleHit is not stored in the database

What steps will reproduce the problem?
1. Mine any Haystack for the needles associate with Url (for Haystack)
2. No hatstack reference is stored in the database before saving the Needle Hit.

What is the expected output? What do you see instead?
The listing should display the Haystack id

What version of the product are you using? On what operating system?
v1.0.0-alpha

Please provide any additional information below.


Original issue reported on code.google.com by harishkamath on 27 Apr 2014 at 10:56

Create task scheduler task to retrieve data from Morningstar

What is the feature that is being added to the Parking Log?

Create task in task scheduler to run everyday to retrieve the data from 
Morningstar

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Critical as this will ensure that we can turn the Godaddy scrape application in 
the future.

Please provide any additional information below. Links, resources etc.

N/A

Original issue reported on code.google.com by harishkamath on 26 Apr 2014 at 7:57

Export the data to a CSV file for import into Quicken

What is the feature that is being added to the Parking Log?

Currently, the data is extracted into CSV file from the scrape application for 
direct import into Quicken

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Critical feature as this is the purpose of the application - however, this 
requires the pman application to be set up.


Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 28 Apr 2014 at 5:31

Use database login for created and modified by fields

What is the feature that is being added to the Parking Log?

Currently this is an input parameter - make this hardcoded non-editable value.

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

Nice to have but required for first release.


Please provide any additional information below. Links, resources etc.


Original issue reported on code.google.com by harishkamath on 27 Apr 2014 at 6:48

Hosting of the website on external site

What is the feature that is being added to the Parking Log?

Hosting the needle CakePHP application at an external site - to provided 
redundancy.

Options: AWS 

http://docs.aws.amazon.com/elasticbeanstalk/latest/dg/create_deploy_PHP_cakePHP.
html

How critical is it to the function of the application i.e. is it a show
stopper or nice-to-have?

This is a nice to have. It will also give exposure to these external services 
provided by Amazon.


Please provide any additional information below. Links, resources etc.

- https://aws.amazon.com/documentation/elasticbeanstalk/



Original issue reported on code.google.com by harishkamath on 22 Apr 2014 at 11:29

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.