Feature request Unless I missed it, there doesn't seem to be a way

In your case, you are using schemas wrong way. <p dir="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Reset increment about mimesis HOT 6 CLOSED

rgoubet commented on June 23, 2024

Reset increment

from mimesis.

Comments (6)

lk-geimfari commented on June 23, 2024

Hi! Actually, there is an accumulator argument for such cases: https://mimesis.name/en/master/api.html#mimesis.Numeric.increment

Here is a usage example:

>>> numeric.increment()
1
>>> numeric.increment(accumulator="a")
1
>>> numeric.increment()
2
numeric.increment(accumulator="a")
2
>>> numeric.increment(accumulator="b")
1
>>> numeric.increment(accumulator="a")
3

from mimesis.

lk-geimfari commented on June 23, 2024

In your case, you are using schemas wrong way.

Instead of doing this:

for i in range(0,5):
    data = schema.create(5)
    print(data[0]['id'])

Do this:

for i in schema.create(5):
    print(i['id'])

from mimesis.

rgoubet commented on June 23, 2024

In your case, you are using schemas wrong way.

In my code example, I'm trying to create 5 fullfilled schemas (that I could then export 5 times) based on the same logical schema. And here, I cannot use a new accumulator every time, unless I instantiate a new Schema object every time.

from mimesis.

lk-geimfari commented on June 23, 2024

@rgoubet Sorry, I don't get the idea. Can you, please, illustrate it on example?

from mimesis.

rgoubet commented on June 23, 2024

My use case is that I want to create multiple, large random data sets in Excel files (generated with openpxl) for stress test purposes. So, let's say I want to create 5 files with 1 million rows each (I use 4 columns for readability, while in practice I get 30):

from mimesis import Field, Schema
from openpyxl import Workbook

_ = Field()

schema = Schema(schema=lambda: {
    "id": _('increment'),
    "timestamp": _('datetime'),
    'version': _('version'),
    'e-mail': _('person.email', domains=['argenx.com']),
    'token': _('token_hex'),
}

Now, I'll run a loop for each file, and use the iterator to preserve memory:

for i in range(0,5):
    wb = Workbook(write_only=True)
    ws = wb.create_sheet()
    for ix, v in enumerate(schema.iterator(1_000_000)):
        if ix==0:
            ws.append(list(v.keys())) # write headers
        else:
            ws.append(list(v.values())) # write data
    xl_file = os.path.join(path, f'data{str(i).zfill(3)}.xlsx')
    wb.save(xl_file)
    wb.close()

Now, it's all good, except that the id column increment continues in each file instead of restarting from 1. In my case, that could have been an issue as it can then become a larger number than I would want for the data type I want (turned out ok in the end).

As I said, maybe I missed something, but it would be nice to have a reset option (e.g. in the create and iterator methods) for the increments. Not critical at all, though.

from mimesis.

stale commented on June 23, 2024

This issue has been automatically marked as stale because it has not had activity. It will be closed if no further activity occurs. Thank you for your contributions.

from mimesis.

Reset increment about mimesis HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs