GithubHelp home page GithubHelp logo

didi / es-fastloader Goto Github PK

View Code? Open in Web Editor NEW
262.0 262.0 91.0 25.41 MB

Quickly build large-scale ElasticSearch indices by using the fault tolerance and parallelism of Hadoop

License: Apache License 2.0

Shell 7.96% Java 85.80% Batchfile 6.24%

es-fastloader's Issues

xpack权限问题

安装es插件后,调用rest uri会提示
"action [indices:append-lucene] is unauthorized for user [elastic]"

es高级版本不能routing算法不能再用2.X的代码了

这里给出ES7.X的代码:
int routingNumShards=MetaDataCreateIndexService.calculateNumRoutingShards(numShards, Version.CURRENT);
int routingFactor = routingNumShards / numShards;

int shardId = Math.floorMod(hash, routingNumShards) / routingFactor;

希望能有所帮助

打包部署运行问题

有几个问题想请教一下,谢谢。

1、ES插件打包问题
wiki/使用文档” 中提到 “修改好ES代码之后,只需要将mr目录拷贝到ES源码的modules目录中,然后执行ES的编译打包命令,就可以获得安装了appendLucene插件的ES安装包了。“
我一开始把ES-Fastloader\mr目录放进去打包,没看到生成jar包;后来感觉不对,就把 ES-Fastloader\plugin 复制到ES源码的modules目录下,编译打包就能看到jar包了: modules\plugin\build\distributions\plugin-6.6.1-SNAPSHOT.jar。
想确认一下是不是使用ES-Fastloader\plugin。

2、ES部署问题
部署ES,得部署修改过源码的ES版本,还是说可以部署没改过源码的ES。

3、关于运行场景
mr任务运行执行hadoop时入参有esTemplate和time,看代码索引名是由这两个部分组成比如最终是 mytable_20200715。
如果我已有的索引名不是这么组成的,比如索引名就叫mytable,是不是就用不了,得重建索引吗?

另外,首页的微信二维码过期了,方便更新一下吗?

请教一个问题,关于nodemanager端本地生成的lucene文件加载到es本地节点之后的操作

您好,我们的业务场景和滴滴的稍有不同,感觉会简单些
暂时我们不会有更新索引的场景,而是每天都新建一份索引,全量刷写数据
我想问下,hadoop fs -get已经把文件按照shard对应关系拉取到es集群本地的节点了
后面如何把lucene的各个文件加载到es集群呢,相当于把这个部分数据加载到一个空索引里面,我理解就是让文件和索引在es端对应起来,不知道这个步骤直接把文件放进索引的存储目录行不行~
还是说要调取lucene的indexWriter这种操作呢,多谢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.