rosedblabs / wal Goto Github PK

View Code? Open in Web Editor NEW

199.0 4.0 32.0 72 KB

Write Ahead Log for LSM or bitcask storage, designed to optimize the random io workloads.

License: Apache License 2.0

Go 100.00%

database kv-store lsm-tree storage bitcask write-ahead-log

wal's People

Contributors

Stargazers

Watchers

wal's Issues

Who is using WAL?

Calling all WAL(Write Ahead Log) users!

Share a brief description of your project/use case, how WAL benefits you, and any encountered challenges.

Your insights will strengthen our community and inspire others.

Comment below with:

project overview
what scenario do you use WAL?
what new feature do you expect for WAL?

wrong commets

wal/segment.go

Lines 32 to 34 in 5178372

 // Checksum Type Length 

 // 4 2 1 

 chunkHeaderSize = 7

Type and Lenght should be exchanged.

According to

wal/segment.go

Lines 224 to 232 in 5178372

 // Length 2 Bytes index:4-5 

 binary.LittleEndian.PutUint16(buf[4:6], uint16(dataSize)) 

 // Type 1 Byte index:6 

 buf[6] = chunkType 

 // data N Bytes index:7-end 

 copy(buf[7:], data) 

 // Checksum 4 Bytes index:0-3 

 sum := crc32.ChecksumIEEE(buf[4:]) 

 binary.LittleEndian.PutUint32(buf[:4], sum)

privatize semgent's functions

This is just my OCD kicks in, there is no actual access to the semgent's functions from the WAL struct since segment attr is private itself, but should its functions be private as well?

Add NewReaderWithStart() to wal struct

Usecase:
we are planning to build a snapshot of a service with WAL project. The service needs to stop all writing events and get the latest of chunk position in the wal instance. When the service experiences an outage or downtime, the service can rebuild the state from snapshot file and replay the events since latest chunk position. I was wondering if we could add a LastChunkPosition function to wal struct?

something like:
func (wal *WAL) LastChunkPosition() (*ChunkPosition, error)

Support batch writes

Support Data Compression

Snappy
https://github.com/golang/snappy
ZSTD
https://github.com/DataDog/zstd

An unreachable if branch

wal/segment.go

Line 225 in 8de9190

if end > dataSize {

chunkSize <= leftSize, end <= dataSize, the second if branch is unreachable.

if chunkSize > leftSize {
chunkSize = leftSize
}

var end = dataSize - leftSize + chunkSize
if end > dataSize {
end = dataSize
}

Couple of questions

Hi, I'm thinking of using this to increase the rate of instrument transactions that we can process, by using a local WAL I can increase the throughput as I can process the requests to the database in a worker.

So reading this:

	// Sync is whether to synchronize writes through os buffer cache and down onto the actual disk.
	// Setting sync is required for durability of a single write operation, but also results in slower writes.
	//
	// If false, and the machine crashes, then some recent writes may be lost.
	// Note that if it is just the process that crashes (machine does not) then no writes will be lost.
	//
	// In other words, Sync being false has the same semantics as a write
	// system call. Sync being true means write followed by fsync.
	Sync bool

I'm a little bit confused - if there's a fatal crash in the process, how will writes not be lost? If they're stored in a buffer in memory, before fsync, then how are those writes recoverred?

Second question, if I'm simultaneously writing and reading to (and then deleting from) the WAL from different threads:

I use:

w.WAL.Write(b)

to write, and:

reader := w.WAL.NewReader()
			for {
				val, pos, err := reader.Next()
				if err == io.EOF {
					break
				}
				fmt.Println(string(val))
				fmt.Println(pos) // get position of the data for next read
				w.ch <- val
			}

to read. Does reader := w.WAL.NewReader() return all the segments up and until the point in time that the function is called? I think it does looking at:

	if segId == 0 || wal.activeSegment.id <= segId {
		reader := wal.activeSegment.NewReader()
		segmentReaders = append(segmentReaders, reader)
	}

and then:

func (seg *segment) NewReader() *segmentReader {
	return &segmentReader{
		segment:     seg,
		blockNumber: 0,
		chunkOffset: 0,
	}
}

seems to be 0 chunks in the new reader that was created and therefore it doesn't process any messages in there?

What's also the safest way to delete so that I never reprocess a message twice (although it isn't the end of the world if I do (if it's chronological), it's just costs time).

I can work it out with sufficient testing, but I figured it may be worth asking here.

Thank you in advance 🧡

// Open opens a WAL with the given options.
// It will create the directory if not exists, and open all segment files in the directory.
// If there is no segment file in the directory, it will create a new one.
func Open(options Options) (*WAL, error) {
	if !strings.HasPrefix(options.SegmentFileExt, ".") {
		return nil, fmt.Errorf("segment file extension must start with '.'")
	}
	if options.BlockCache > uint32(options.SegmentSize) {
		return nil, fmt.Errorf("BlockCache must be smaller than SegmentSize")
	}
        ....
}

Read Data Bug

reproduce:

func TestSegment_Write_LargeSize(t *testing.T) {
	t.Run("32KB-10000", func(t *testing.T) {
		testSegmentReaderLargeSize(t, 32*blockSize, 7000)
	})
}

func testSegmentReaderLargeSize(t *testing.T, size int, count int) {
	dir, _ := os.MkdirTemp("", "seg-test-reader-ManyChunks_large_size")
	os.MkdirAll(dir, os.ModePerm)
	cache, _ := lru.New[uint64, []byte](5)
	seg, err := openSegmentFile(dir, ".SEG", 1, cache)
	assert.Nil(t, err)
	defer func() {
		_ = seg.Remove()
	}()

	positions := make([]*ChunkPosition, 0)
	bytes1 := []byte(strings.Repeat("W", size))
	for i := 1; i <= count; i++ {
		pos, err := seg.Write(bytes1)
		assert.Nil(t, err)
		positions = append(positions, pos)
	}

	for i, pos := range positions {
		val, err := seg.Read(pos.BlockNumber, pos.ChunkOffset)
		assert.Nil(t, err)
		if !bytes.Equal(bytes1, val) {
			t.Log(i)
			t.Log(len(val))
			break
		}
	}
}

firstly, change segSize function like this to avoid another problem

func (seg *segment) Size() int64 {
	size := int64(seg.currentBlockNumber) * int64(blockSize)
	return size + int64(+seg.currentBlockSize)
}

but the bug still exists.

	// Length 2 Bytes index:4-5
	binary.LittleEndian.PutUint16(buf[4:6], uint16(dataSize))
	// Type 1 Byte index:6
	buf[6] = chunkType
	// data N Bytes index:7-end
	copy(buf[7:], data)
	// Checksum 4 Bytes index:0-3
	sum := crc32.ChecksumIEEE(buf[4:])
	binary.LittleEndian.PutUint32(buf[:4], sum)

rosedblabs / wal Goto Github PK

wal's People

Contributors

Stargazers

Watchers

Forkers

wal's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs