dragonfly

mirror of https://github.com/dragonflydb/dragonfly.git synced 2025-05-11 18:35:46 +02:00

Author	SHA1	Message	Date
Roman Gershman	03b3f86aed	chore: Track db_slice table memory instantly (#3375 ) We update table_memory upon each deletion and insertion of an element. Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-07-24 14:13:08 +03:00
Vladislav	f81a893368	chore(tiering): Range functions + small refactoring (#3207 )	2024-07-22 18:36:11 +03:00
Roman Gershman	7df6771eaa	fix: do not upload offload values on a first hit (#3360 )	2024-07-22 16:23:22 +03:00
Kostas Kyrimis	8a2d6ad1f4	fix: ub in RegisterOnChange and regression tests for big values (#3336 ) * fix replication test flag name for big values * fix a bug that triggers ub when RegisterOnChange is called on flows that iterate over the callbacks and preempt * add a stress test for big value serialization Signed-off-by: kostas <kostas@dragonflydb.io>	2024-07-19 07:03:17 +00:00
Kostas Kyrimis	bfa5df5d6c	feat: add an option to flush serialized entries on threshold limit (#3241 ) * serialize big slots in chunks * allow preemption on large slots * disable big entries serialization for RDB files * add test Signed-off-by: kostas <kostas@dragonflydb.io>	2024-07-18 10:15:41 +00:00
Roman Gershman	b9f8671df9	chore(tiering): add protection against overruning memory budget (#3327 ) chore(tiering): Introduce second chance replacement strategy Introduce hot/cold replacement strategy https://www.geeksforgeeks.org/second-chance-or-clock-page-replacement-policy/ Also, add protection against overruning memory budget Finally, cancel in-flight offloading requests for entries that were looked up. Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-07-18 03:52:43 -04:00
Shahar Mike	18ca61d29b	feat(namespaces): Initial support for multi-tenant (#3260 ) * feat(namespaces): Initial support for multi-tenant #3050 This PR introduces a way to create multiple, separate and isolated namespaces in Dragonfly. Each user can be associated with a single namespace, and will not be able to interact with other namespaces. This is still experimental, and lacks some important features, such as: * Replication and RDB saving completely ignores non-default namespaces * Defrag and statistics either use the default namespace or all namespaces without separation To associate a user with a namespace, use the `ACL` command with the `TENANT:<namespace>` flag: ``` ACL SETUSER user TENANT:namespace1 ON >user_pass +@all ~* ``` For more examples and up to date info check `tests/dragonfly/acl_family_test.py` - specifically the `test_namespaces` function.	2024-07-16 19:34:49 +03:00
Vladislav	22756eeb81	fix(migration): Use transactions! (#3266 ) Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-07-16 14:06:34 +03:00
Shahar Mike	d7351b315e	refactor: Use `DbContext`, `OpArgs` and `Transaction` to access `DbSlice` (#3311 ) This is a refactor that will put us closer to adding namespaces, see included `docs/namespaces.md`	2024-07-12 08:13:16 +03:00
Roman Gershman	f9ded47c3d	chore: refactoring around tiered storage (#3299 ) * chore: refactoring around tiered storage 1. Renamed ReportXXX callbacks to NotifyXXX 2. Pulled RecordDelete/RecordAdded out of TieredStorage::ShardOpManager. 3. Moved TieredStorage::ShardOpManager functions to to private scope. 4. Streamlined code in TieredStorage::Delete Signed-off-by: Roman Gershman <roman@dragonflydb.io> * fix: Preserve expiry upon uploading external values --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-07-11 14:47:26 +03:00
Roman Gershman	fba902d0ac	fix: properly clean tiered state upon flash (#3281 ) * fix: properly clean tiered state upon flash The bug was around io pending entries that have not been properly cleaned during flush. This PR simplified the logic around tiered storage handling during flush, it always performs the cleaning in the synchronous part of the command. In addition, this PR improves error logging in tests if dragonfly process exits with an error. Finally, a test is added that makes sure pending tiered items are flushed during the flash call. Fixes #3252 --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-07-08 10:43:11 +03:00
Shahar Mike	f28bd93854	fix(cluster): Don't miss keys when migrating slots (#3218 ) In rare cases, the fuzzy cluster migration test detected missing keys. It turns out that the missing keys were skipped at the source side due to contention: * The OnDbChange callback registered and got a `snapshot_id` * It then blocked on a mutex, and could not add itself to the list of callbacks * When the mutex was released, it registered, but it missed all changes that happened between registering (`snapshot_id`) and the moment it registered	2024-06-25 15:41:17 +03:00
Vladislav	4357933775	feat(server): expiry notifications (#3154 ) Adds basic support for keyspace notifications, only Ex Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-06-24 16:23:40 +03:00
Borys	d75c79ce5c	fix: fix RegisterOnChange methods for journal and db_slice (#3171 ) * fix: fix RegisterOnChange methods for journal and db_slice. Call db_slice and journal callbacks atomically. Made a hack to avoid deadlock during SAVE	2024-06-20 12:37:37 +03:00
Kostas Kyrimis	0edcbc80ef	chore: recommit client tracking (#3136 ) * bring back client tracking * disable all client commands in squashing * allow all client commands in multi/exec block	2024-06-07 15:34:44 +03:00
Roman Gershman	6498f88619	chore: a small cleanup (#3140 ) * chore: a small cleanup No functional changes. --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-06-06 15:16:52 +03:00
adiholden	2a5a53f1d9	Revert "refactor client tracking, fix atomicity, squashing and multi/… (#3122 ) Revert "refactor client tracking, fix atomicity, squashing and multi/exec (#2970)" This reverts commit `b1063f7823`.	2024-06-04 10:34:32 +03:00
Kostas Kyrimis	b1063f7823	refactor client tracking, fix atomicity, squashing and multi/exec (#2970 ) * add partial support for CLIENT CACHING TRUE (only to be used with TRACKING OPTIN) * add OPTIN to CLIENT TRACKING command * refactor client tracking to respect transactional atomicity * fixed multi/exec and disabled squashing with client tracking * add tests	2024-06-03 22:14:30 +03:00
Vladislav	137bd313ef	fix(server): Sync FLUSH with tiering (#3098 ) * fix(server): Sync FLUSH with tiering Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-05-30 16:50:12 +03:00
Vladislav	68d1a8680c	fix(tiering): Async delete for small bins (#3068 ) * fix(tiering): Async delete for small bins --------- Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-05-28 12:08:59 +03:00
Vladislav	9e3748421b	fix(tiering): rename v2 + max_file_size (#3004 ) * fix: rename + max_file_size Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-05-03 21:36:18 +03:00
Vladislav	08983c181f	chore: small tiering fixes (#2966 ) * chore: tiering fixes Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-05-02 18:00:46 +00:00
Roman Gershman	9bda5b1d4b	chore: another preparation commit to get rid of kv_args in transaction (#2996 ) This changes Entry::Payload to struct instead of variant. Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-05-02 09:59:45 +03:00
Vladislav	82dd05fe30	chore: Remove TieringV1 (#2962 ) Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-04-26 11:01:05 +03:00
adiholden	d5cd0ed204	fixes for v1.18.0 (#2956 ) * fix server: change table_growth_margin default value --------- Signed-off-by: adi_holden <adi@dragonflydb.io>	2024-04-24 18:21:14 +03:00
Roman Gershman	89b1d7d52a	chore: Introduce ShardArgs as a distinct type (#2952 ) Done in preparation to make ShardArgs a smart iterable type, but currently it's just a wrapper aroung ArgSlice. Also refactored common.{h,cc} into tx_base.{h,cc} In addition, fixed a bug in key tracking, where we wrongly created weak_ref in a shard thread instead of doing this in the coordinator thread. Finally, identified another bug (not fixed yet) where we track all the arguments instead of tracking keys only. Besides this, no functional changes around the moved code. Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-04-24 13:36:34 +03:00
Borys	2230397a12	refactor: add cluster namespace (#2948 ) * refactor: add cluster namespace, remove extra includes	2024-04-22 21:45:43 +03:00
Roman Gershman	2ff7ff9841	chore: get rid of lock keys (#2894 ) * chore: get rid of lock keys 1. Introduce LockTag a type representing the part of the key that is used for locking. 2. Hash keys once in each transaction. 3. Expose swap_memory_bytes metric. --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-04-21 11:34:42 +03:00
Roman Gershman	8030ee96b5	chore: preparation step for lock fingerprints (#2899 ) The main change here is introduction of the strong type LockTag that differentiates from a string_view key. Also, some testing improvements to improve the footprint of the next PR. Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-04-16 19:23:50 +03:00
Vladislav	4fe00a071e	chore(tiering): Update Get, Set, Del (#2897 ) * chore(tiering): Update Get, Set and Del --------- Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-04-16 19:20:24 +03:00
Roman Gershman	da5c51d1dd	chore: LockTable tracks fingerprints of keys (#2839 ) * chore: LockTable tracks fingerprints of keys It's a first step that will probably simplify dependencies in many places where we need to keep key strings for that. A second step will be to reduce the CPU load of multi-key operations like MSET and precompute Fingerprints once. --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-04-10 17:52:53 +03:00
Shahar Mike	54c9633cb8	feat(dbslice): Add self-laundering iterator in `DbSlice` (#2815 ) A self-laundering iterator will enable us to, eventually, yield from fibers while holding an iterator. For example: ```cpp auto it1 = db_slice.Find(...); Yield(); // Until now - this could have invalidated `it1` auto it2 = db_slice.Find(...); ``` Why is this a good idea? Because it will enable yielding inside PreUpdate() which will allow breaking down of writing huge entries in small quantities to disk/network, eliminating the need to allocate huge chunks of memory just for serialization. Also, it'll probably unlock future developments as well, as yielding can be useful in other contexts.	2024-04-09 12:00:52 +03:00
Borys	84d451fbed	fix: #2745 don't start migration process again after apply the same the same config is applied (#2822 ) * fix: #2745 don't start a migration process again after the same config is applied refactor: remove extra includes	2024-04-03 10:21:27 +03:00
Shahar Mike	1d04683c48	fix(cluster): Don't miss updates in FLUSHSLOTS (#2783 ) * fix(flushslots): Don't miss updates in `FLUSHSLOTS` This PR registers for PreUpdate() from inside the `FLUSHSLOTS` fiber so that any attempt to update a to-be-deleted key will work as expected (first delete, then apply the change). This fixes several issues: * Any attempt to touch bucket B (like insert a key), where another key in B should be removed, caused us to _not_ remove the latter key * Commands which use an existing value but not completely override then, like `APPEND` and `LPUSH` did not treat the key as removed but instead used the original value Fixes #2771 * fix flushslots syntax in test * EXPECT_EQ(key:0, xxxx) * dbsize	2024-03-31 15:47:38 +03:00
Roman Gershman	9e23f85e6b	chore: expose SBF via compact_object (#2797 ) * chore: expose SBF via compact_object --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-03-30 22:35:22 +03:00
Kostas Kyrimis	370f334baf	chore: remove duplicate code from dash and simplify (#2765 ) * rename all Policy members for consistency * remove duplicate code	2024-03-29 11:14:58 +02:00
Vladislav	c8724adddf	chore: Fix memcached flags not updated (#2787 ) Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>	2024-03-29 12:10:58 +03:00
Kostas Kyrimis	4025b4a6af	fix: fiber preempts on read path and OnCbFinish() clears fetched_items_ (#2763 ) * cache fetched_items_ before preemption such that OnCbFinish does not affect it	2024-03-26 16:38:47 +02:00
adiholden	2ad7439128	feat(server): support cluster replication (#2748 ) * feat(server): support cluster replication Signed-off-by: adi_holden <adi@dragonflydb.io>	2024-03-26 15:26:19 +02:00
Roman Gershman	954780edd1	Remove check-fail in ExpireIfNeeded and introduce DFLY LOAD (#2699 ) * chore: prevent crashing upon inconsistent expiry table Also, introduce "DFLY LOAD <filename>" command in addition to "DEBUG LOAD" as an official command to load snapshots into the running server. --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-03-12 16:54:13 +02:00
Roman Gershman	b38024ba4f	chore: add malloc-based stats and decommit (#2692 ) * chore: add malloc-based stats and decommit Provides more stats and control with glibc-malloc based allocator. For example, with v1.15.0 (--proactor_threads=2), empty database, `info memory`returns ``` used_memory:614576 used_memory_human:600.2KiB used_memory_peak:614576 used_memory_peak_human:600.2KiB used_memory_rss:19922944 used_memory_rss_human:19.00MiB ``` then during `memtier_benchmark -n 300000 --key-maximum 100000 --ratio 0:1 --threads=30 -c 100` (i.e GET-only with 3k connections): ``` used_memory:614576 used_memory_human:600.2KiB used_memory_peak:614576 used_memory_peak_human:600.2KiB used_memory_rss:59985920 used_memory_rss_human:57.21MiB used_memory_peak_rss:59985920 ``` connections overhead grows by ~39MB. when the traffic stops, `used_memory_rss_human` becomes `30.35MiB` and we do not know where 11MB gets lost and `MEMORY DECOMMIT` does not reduce the RSS. With this change, `memory malloc-stats` return during the memtier traffic ``` malloc arena: 394862592 malloc fordblks: 94192 ``` i.e. 395MB virtual memory was allocated by malloc and only 94KB is chunks available for reuse. 395MB is arena virtual memory, and not RSS obviously, but at least we have some visibility into malloc reservations. The RSS usage is the same ~57MB and the difference between virtual and RSS is due to the fact we reserve fiber stacks of size 131KB but we touch less. After the traffic stops, `arena` is reduced to 134520832 bytes, and fordblks are 133016592, i.e. majority of reserved ranges are also free (available to reuse) in the malloc pools. RSS goes down similarly to before to ~31MB. So far, this PR only demonstrated the increased visibility to mmapped ranges reserved by glibc malloc. The additional functional change is in `MEMORY DECOMMIT` that now trims malloc RSS usage from reserved but unused (fordblks) pages by calling `malloc_trim`. After the call, RSS is: `used_memory_rss_human:20.29MiB` which is almost the same as when we started the empty process. Signed-off-by: Roman Gershman <roman@dragonflydb.io> * chore: fix build for older glibc environments Disable these extensions for alpine and use legacy version for older glibc libraries. --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-03-06 13:11:44 +00:00
Shahar Mike	35b0ab101e	fix(flushall): Decommit memory after releasing tables. (#2691 ) In the fiber we used to call `mi_heap_collect()` when we're done deleting items. But since that fiber captures a `vector` of intrusive pointers to `DbTable`s, it can't free all memory used by the tables themselves. A local test shows that this fix helps almost entirely: when occupying a 5gb DB, `FLUSHALL` will reduce RSS by 4.7gb, leaving 300mb still used. A follow up `MEMORY DECOMMIT` will indeed remove these 300mb, but I'm still not sure why they are not released immediately. Still looking... Addresses (1) of #2690	2024-03-05 15:45:13 +02:00
adiholden	7c443f3a15	feat(server): introduce table_growth_margin flag (#2678 ) * feat(server): introduce table_growth_margin flag Signed-off-by: adi_holden <adi@dragonflydb.io>	2024-03-03 13:02:18 +00:00
Borys	8771ab32a6	refactor: create one type for slots set #2459 (#2645 ) * refactor: create one type for slot ranges #2459	2024-02-23 14:10:42 +02:00
Roman Gershman	fa75360227	chore: get rid of object.c and robj* in cc code (#2610 ) Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-02-18 16:52:23 +02:00
adiholden	32e8d49123	feat(tiering): add background offload step (#2504 ) * feat(tiering): add background offload step Signed-off-by: adi_holden <adi@dragonflydb.io	2024-02-14 14:28:41 +02:00
Roman Gershman	4000adf57f	fix: do not migrate during connection close (#2570 ) * fix: do not migrate during connection close Fixes #2569 Before the change we had a corner case where Dragonfly would call OnPreMigrateThread but would not call CancelOnErrorCb because OnBreakCb has already been called (it resets break_cb_engaged_) On the other hand in OnPostMigrateThread we called RegisterOnErrorCb if breaker_cb_ which resulted in double registration. This change simplifies the logic by removing break_cb_engaged_ flag since CancelOnErrorCb is safe to call if nothing is registered. Moreover, we now skip Migrate flow if a socket is being closed. --------- Signed-off-by: Roman Gershman <roman@dragonflydb.io>	2024-02-12 16:03:34 +02:00
adiholden	503891b1fa	fix(server): update post updater iterator in tiering (#2497 ) * fix(server): update post updater iterator in tiering Signed-off-by: adi_holden <adi@dragonflydb.io>	2024-01-30 11:46:00 +00:00
Roman Gershman	d608ec9c62	chore: Introduce LockKey for LockTable (#2463 ) This should reduce allocations in a common case (not multi). In addition, rename Transaction::args_ to kv_args_. Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io> Co-authored-by: Vladislav <vlad@dragonflydb.io>	2024-01-28 12:19:15 +02:00
adiholden	9f4c4353b5	fix(server): mget crash on same key get (#2474 ) * fix(server): mget crash on same key get fix: #2465 the bug: on cache mode mget bumps up items. When executing mget with the same key several times i.e mget key key we will invalidate the iterator when we bump up the item in dash table. the fix: bump up/down items only once by using bumped_items set This PR also reverts `c225113` and updates the bumped stats and bumped_items set if the item was bumped Signed-off-by: adi_holden <adi@dragonflydb.io>	2024-01-28 11:45:35 +02:00

1 2 3 4

170 commits