GithubHelp home page GithubHelp logo

gxtrobot / bustag Goto Github PK

View Code? Open in Web Editor NEW
3.9K 3.9K 587.0 3.3 MB

a tag and recommend system for old bus driver 给老司机用的一个番号推荐系统

License: MIT License

Python 15.90% Makefile 0.13% JavaScript 78.02% Smarty 5.45% Dockerfile 0.41% Shell 0.07% CSS 0.01%

bustag's People

Contributors

gxtrobot avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

bustag's Issues

请问如何清空未打标的的数据

我打标了一些数据之后发现虽然我喜欢的和不喜欢的都已经进到了对应的分类里,但是剩下未打标的数据都是我觉得一般的,留下来貌似没什么用啊,请问怎样清空这些数据呢?

docker运行出错,报错信息如内所示

/app
dirname:/app/src/bustag/bustag/app
Bustag server starting: version: 0.2.0

CWD: /app
system error
Press Enter to continue ...
Traceback (most recent call last):
File "/usr/local/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/usr/local/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/app/src/bustag/bustag/app/index.py", line 221, in
input()
EOFError: EOF when reading a line

win10下载失败

serving on 0.0.0.0:8000 view at http://127.0.0.1:8000
start download
Job "download (trigger: date[2019-09-16 23:39:29 CST], next run at: 2019-09-16 23:39:29 CST)" raised an exception
Traceback (most recent call last):
File "lib\site-packages\apscheduler\executors\base.py", line 125, in run_job
File "bustag\app\schedule.py", line 18, in download
KeyError: 'download.root_path'

没有任何更新和推荐数据

如果不导入任何数据库文件的话,是不是不会自动下载任何数据?我用docker部署上去后,并不显示任何推荐等数据,导入数据库文件后,仍然没有任何更新和推荐,我查看了一下容器的日志,如下
/app,
Bottle v0.12.17 server starting up (using PasteServer())...,
Listening on http://0.0.0.0:8000/,
Hit Ctrl-C to quit.,
,
2019-10-08 03:52:24,828 - bustag - WARNING - classifier.py - recommend ,
no data for recommend ,
2019-10-08 03:52:25,909 - aspider - WARNING - crawling.py - exit_on_empty_queue ,
empty queue, now quit ,
2019-10-08 03:52:25,909 - aspider - WARNING - crawling.py - crawl ,
closing the crawler ,
2019-10-08 03:52:25,909 - aspider - WARNING - crawling.py - work ,
canceling the worker ,
2019-10-08 03:52:25,909 - aspider - WARNING - crawling.py - work ,
canceling the worker ,
2019-10-08 03:52:25,909 - aspider - WARNING - crawling.py - work ,
canceling the worker ,
2019-10-08 03:52:25,909 - aspider - WARNING - crawling.py - work ,
canceling the worker ,

docker运行出错

安装在群晖docker中,一直运行正常,昨天重启群晖后输入
docker run --rm -d -v $(pwd)/data:/app/data -p 8000:8000 gxtrobot/bustag-app
提示错误:
docker: Error response from daemon: Bind mount failed: '/root/data' does not exists.
新建/root/data后
没有错误提示了,但是docker ps没有项目在运行,浏览器无法访问,求解

EOFError: EOF when reading a line

dawnma @ dawnmadeMacBook-Pro in ~/Downloads/bustag [1:46:53]

$ docker run --rm -v $(pwd)/data:/app/data -p 8000:8000 gxtrobot/bustag-app
/app
dirname:/app/src/bustag/bustag/app
Bustag server starting: version: 0.2.0

CWD: /app
system error
Press Enter to continue ...
Traceback (most recent call last):
File "/usr/local/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/usr/local/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/app/src/bustag/bustag/app/index.py", line 221, in
input()
EOFError: EOF when reading a line

打标了400个为啥训练模型全是0

已经标记了400个(喜欢与不喜欢之和)
尚有40多个未标记的。然后点击开始训练。控制台输出这个,并且模型准确率、覆盖率均为0。
且推荐页面没有输出任何一个东西

F:\BaiduNetdiskDownload\bustag_win_0.1.1\bustag\bustag\model\classifier.py:36: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples, ), for example using ravel().
F:\BaiduNetdiskDownload\bustag_win_0.1.1\bustag\sklearn\metrics\classification.py:1437: UndefinedMetricWarning: Precision is ill-defined and being set to 0.0 due to no predicted samples.
F:\BaiduNetdiskDownload\bustag_win_0.1.1\bustag\sklearn\metrics\classification.py:1437: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 due to no predicted samples.
tp: 0, fp: 0
INFO:bustag:tp: 0, fp: 0
fn: 7, tn: 92
INFO:bustag:fn: 7, tn: 92
precision_score: 0.0
INFO:bustag:precision_score: 0.0
recall_score: 0.0
INFO:bustag:recall_score: 0.0
f1_score: 0.0
INFO:bustag:f1_score: 0.0
new model trained
INFO:bustag:new model trained
/do-training
127.0.0.1 - - [09/Sep/2019:13:49:29 +0800] "GET /do-training HTTP/1.1" 200 4375 "http://127.0.0.1:8000/do-training" "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.100 Safari/537.36"

启动了,打不开

http://127.0.0.1:8000打不开呀?一直在debug

DEBUG:bustag:save tag_item: 4452
save tag: 1160
DEBUG:bustag:save tag: 1160
save tag_item: 4453
DEBUG:bustag:save tag_item: 4453
save tag_item: 4454
DEBUG:bustag:save tag_item: 4454
save tag_item: 4455
DEBUG:bustag:save tag_item: 4455
save tag_item: 4456
DEBUG:bustag:save tag_item: 4456
save tag_item: 4457
DEBUG:bustag:save tag_item: 4457
save tag_item: 4458
DEBUG:bustag:save tag_item: 4458

建议增加arm64 docker镜像

发现兴致勃勃的下好了docker镜像,最后发现只有x86才能用....
standard_init_linux.go:211: exec user process caused "exec format error"

1

6666

0.2.0模型训练不能

点击训练模型后所有数值均为0,命令行提示
\bustag\model\classifier.py:37: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples, ), for example using ravel().
/do-trainin

希望增加已阅和按标签筛选的功能

按标签可以快速筛选出大量自己喜欢和讨厌的数据以快速完成打标

已阅是因为很多人都会快速打标,但实际喜欢的并没有去下载,因此可以加一个已阅标记,在喜欢列表里真的下载过的可以设置为已阅,方便用户管理自己看过和没看过的数据

嗯对数据。。。

请问定时刷新逻辑是什么?

比如我现在已经有3000个样本了。定时刷新再获取1000个,但是30分钟内并没有1000个新的视频发布,比如只有30个,那会往更往前的旧的历史资料里面获取1000个吗?还是新的30个?还是新的30加旧的970个

powershell运行时出错。

输入 docker run --rm -d -e TZ=Asia/Shanghai -e PYTHONUNBUFFERED=1 -v ${PWD}/data:/app/data -p 8000:8000 gxtrobot/bustag-app后,出错。

错误提示如下:
C:\Program Files\Docker\Docker\Resources\bin\docker.exe: Error response from daemon: driver failed programming external connectivity on endpoint heuristic_kilby (22adc3bae8fafcecfc8692577f0dcb92f121bbcc18ff9fee9cb39fe4d4c80d52): Error starting userland proxy: /forwards/expose/port returned unexpected status: 500.

不知道什么原因,第一次运行的时候,该下载的东西都下载好了。bustag.exe窗口没有报错。localhost:8000无法访问。

1.x to 2.0 导入数据不全

2000余条数据导入2.0 之后只剩下24条。
没有报错,导入时命令行显示有2000+条,导入完成后自动更新,最后 ALL DONE。

win10 x64 ver 1903 18362.387

有一部分片子抓不到

比对了下今天抓到的和源站首页展示的,有很大一部分从未见到过。
不知为何bustag没有抓下来,更新规则是默认的30分钟抓300个。
所以对更新抓取逻辑有点好奇。

另外有一部分片子的演员未显示出来。

新版海报变大了好评,建议更大一些,或者增加一个点击放大选项

win 10 启动报错 UnknownTimeZoneError

Exception in thread Thread-1:
Traceback (most recent call last):
File "threading.py", line 926, in _bootstrap_inner
File "threading.py", line 870, in run
File "bustag\app\schedule.py", line 48, in start_scheduler
File "lib\site-packages\apscheduler\schedulers\base.py", line 87, in init
File "lib\site-packages\apscheduler\schedulers\base.py", line 126, in configure
File "lib\site-packages\apscheduler\schedulers\asyncio.py", line 48, in _configure
File "lib\site-packages\apscheduler\schedulers\base.py", line 697, in _configure
File "lib\site-packages\tzlocal\win32.py", line 93, in get_localzone
File "lib\site-packages\tzlocal\win32.py", line 84, in get_localzone_name
pytz.exceptions.UnknownTimeZoneError: 'Can not find timezone '

子路径支持

如现有域名 foo.com,正跑着一个非常正常的网站,这时我希望加一个路径:foo.com/bustag 可以代理到 docker 中的 8000 端口,目前的代码并不支持这种操作,建议支持一下

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.