Light

hpdell / luojia1-clawer Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 0.0 7 KB

TypeScript 100.00%

luojia1-clawer's Introduction

珞珈一号影像数据爬虫

本仓库是用于按行政区下载珞珈一号影像数据的爬虫。支持断点续爬。没有文件校验，因此有时下载的影像会不可使用。

使用方法

爬虫的运行基于两个配置文件：

爬虫参数配置文件 config.json
影像查询参数配置文件 params.json

爬虫参数

爬虫参数如下所示：

{
    "latestImagingTime": "2018-03-11 14:56:53",     // 上次下载的最后一幅影像的时间
    "username": "您的用户名",   // 用户名
    "password": "您的密码"      // 密码
}

lastestImagingTime 参数，在首次爬取的时候可以设置为 2018 年之前的日期，就可以获取到所有影像。

将 config-sample.json 重命名为 config.json 文件，并填入对应字段的正确值。

影像查询参数

修改 params.json 文件中配置的参数，符合如下模型：

interface QueryParams {
    productLevel: "L2" | "L3" | null;
    level: "province" | "city" | "district";
    zoneNo: number;
}

以广州市为例：

{
    "productLevel": "L2",   // 产品纠正级别，可选 L2 和 L3 ，填 null 表示全部。
    "level": "city",        // 行政区等级，可选 province, city, district，应与 zoneNo 对应。
    "zoneNo": 440100        // 行政区编号，需要从珞珈一号官网上抓包以获取到这个编号。
}

运行爬虫

修改完成后，使用如下命令运行爬虫：

node index.js

下载下来的影像会保存在 data 文件夹中。

其他事项

程序中每次网络请求过后会通过 delay() 函数等待，如果觉得等待时长过长过过短，请自行调整，单位为毫秒。
本次获取的所有影像的列表在 data/image_list_*.csv 文件中，如有文件下载错误，可手动下载。

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs