Comments (10)
This metric will be very useful to understand disk usage by sstables per shard, e.g. see compaction dropping space on the fly, and also easily see data inbalance by shard due to bad modelling. Today we have to scan the directory to get that info, which is bad.
This metric sounds useful. How and when do we update the metrics? E.g., background scan at a low frequency (10s), or update the metric immediately when a sstable is added to / removed from the system. We need to consider file stream and sstables in upload directory too.
SSTable manager knows when a sstable is added / closed, we can easily calculate it on the fly without any scanning. So it covers all sstables in the system. We can consider only sealed sstables.
from scylladb.
It doesn't work as a metric, you can't look at tables, you can't get distributions.
Better a virtual table. We could have a virtual table for sstables, or a virtual table for files under data/.
from scylladb.
It doesn't work as a metric, you can't look at tables, you can't get distributions.
Better a virtual table. We could have a virtual table for sstables, or a virtual table for files under data/.
today we have bloom filter memory size, and you know the percentage of it in comparison to ram. a disk size metric will allow us to quickly get inbalance. I am investigated a problem where this metric would help a lot.
an advance of metric over virtual table is that you understand behavior over time, and I need that for correlating disk usage activity (per shard) with compaction.
from scylladb.
For example, right now, I am looking at number of sstables per shard, bloom filter size per shard, I need sstable disk size per shard, and I only have a node wide one. Today disk usage metric as provided by OS is suboptimal when we need correlation with activity in a single shard
from scylladb.
I think it will be valuable. And I wish we had this metric now.
from scylladb.
What if we add both virtual table and metric? They're not mutually exclusive. They both have its pros / cons. If I am proven wrong, then we remove the metric.
from scylladb.
I put more thought on this and I think metric is valuable. After all, this issue was opened in the context of an investigation and the lack of metric made my life much harder for understanding compaction behavior in a shard. You're right it doesn't provide distribution per table but it already provides distribution per shard (which is enough for detecting imbalance as a first step; today you cannot see it through monitoring) and it's the only way to see behavior over time (something virtual table cannot provide and it's important for correlation with activity in a shard). @avikivity Do you agree? Virtual table can still be added in addition to the metric.
from scylladb.
This metric will be very useful to understand disk usage by sstables per shard, e.g. see compaction dropping space on the fly, and also easily see data inbalance by shard due to bad modelling. Today we have to scan the directory to get that info, which is bad.
This metric sounds useful. How and when do we update the metrics? E.g., background scan at a low frequency (10s), or update the metric immediately when a sstable is added to / removed from the system. We need to consider file stream and sstables in upload directory too.
from scylladb.
Hi, is this issue still active? I am new to this project, and would like to take this issue if possible. Thanks!
from scylladb.
Hi, is this issue still active? I am new to this project, and would like to take this issue if possible. Thanks!
Tanks @changkhothuychung - it's not actively being worked on - feel free to take this.
from scylladb.
Related Issues (20)
- docs: in case of majority loss, restore-from-backup is the only remaining option. HOT 3
- topology_experimental_raft/test_tablets is flaky HOT 1
- Docs: Document hard and soft ScyllaDB limits HOT 2
- `raft::request_aborted` gives no information what exactly was aborted HOT 1
- Seamless transition to inter-node encryption HOT 10
- heap-use-after-free in stream_session vs. cleanup_tablet HOT 3
- the baseline of text in monospace is lower than that of text in regular font HOT 1
- table::calculate_tablet_count() can potentially stall with large tablet count HOT 1
- docs: Issue on page ScyllaDB Fails to Start - SSTable Corruption Problem
- docs: Issue on page Replace a Dead Node in a Scylla Cluster HOT 1
- Scylla 5.4 nodetool status inaccurate statistics issue
- Scylla 5.4 nodetool status inaccurate statistics issue
- raft.replication_test.backpressure_drops fail on timeout
- cql: a crash lurking in ks_prop_defs::get_initial_tablets
- [tablets, MV]: `test_changes_while_node_down`: write failures to view metadata tables during node shutdown HOT 4
- docs: Issue on page Backup your Data - need to use Describe Schema with Internals HOT 3
- test_auth_v2_migration flaky due to auth-v1 known inconsistency problem
- Nodetool rebuild failed with 'rebuild failed: streaming failed' with large partitions and partition scans in parallel HOT 3
- docs: Live updateable configuration parameters
- [x86_64, debug] topology_custom/test_mv_topology_change failed with <Task HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scylladb.