Comments (8)
@klesh can you please assign this task to me? I have started to work on it
from incubator-devlake.
Hi @sstojak1 , it's a valid refactore, but may I know why the same data keeps been adding to the RAW_PULL_REQUEST_TABLE after each run?
from incubator-devlake.
Hi @Startrekzky I would say that this is because BB Server API doesn't support date query parameter. So you cannot fetch PRs that were updated/created after last import job.
Because of that Devlake is importing all PRs during each job run (check how BB Cloud is not doing that since date query parameter is supported for that tool).
from incubator-devlake.
Based on the information you provided, it seems the collector would benefit from using the simpler ApiCollector since it would purge related records from the raw data table before saving new PR information. This might be more efficient compared to the StatefulApiCollector in this context.
from incubator-devlake.
@klesh That might work! Is there a way to run Devlake locally in debug mode? I'd like to go through the ApiCollector impl to understand its impact on the rest of the steps for importing BB server data...
from incubator-devlake.
Yes, sure. You may follow this guide.
In case you wanna execute specific subtasks, you may go to the backend
folder and run sth like the following:
go run plugins/jira/jira.go -c 2 -b 8 -t "extractWorklogs"
remember to change the plugin name and arguments accordingly.
from incubator-devlake.
@sstojak1 Thanks for the reminder. Done. 🤝
from incubator-devlake.
@klesh ready to be reviewed
from incubator-devlake.
Related Issues (20)
- [Bug][GitHub] wrong pipeline name column if run-name is defined HOT 2
- [Bug][Jira][REST API] Unable to set up Jira connection via API due to token length issue HOT 4
- [Bug][Onboard] Option `enableGraphQL` is disabled by default. HOT 1
- [Bug][GitHub] Missing Pull Request data when changing the Time Range in Sync Policy HOT 1
- [Feature][Jira issue changelogs] Add customized jira account fields to table.accounts HOT 1
- [Bug][Sonarcloud] Error 1406 (22001): Data too long for column 'project_key' at row 1 HOT 7
- [Bug][GitLab] Sometimes Project data are automatically deleted. HOT 3
- [Bug][cicd_pipelines] Fix pipelines with Github HOT 11
- [Bug][Gitlab] v1.0.1-beta2 Data Time Range not working HOT 21
- [Bug][CircleCI Plugin] Only collecting first page of API responses HOT 6
- [Bug][Config UI] AxiosError: Request failed with status code 502 HOT 4
- [Bug][Gitlab] Component and File-Level Metrics not displaying HOT 7
- [Feature][Framework] Reminder before token is expired
- [Question][Data Models] Code Quality / Testing HOT 7
- [Bug][Pagerduty] The PagerDuty connection is pulling incident data but this is not being reflected in metrics. HOT 7
- [Bug][Framework] Projects shouldn't be deleted if it has any connections.
- [Bug][pagerduty] DevLake does not collect PagerDuty incidents HOT 5
- [Feature][PR metrics] Add key timestamps to the `project_pr_metrics` table
- [Feature][GitHub] Hide 'enable GraphQL' in the GitHub connection configuration
- [Question][AzureDevOps] Pipeline run retention HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from incubator-devlake.