[Date Prev][Date Next] [Chronological] [Thread] [Top]

Question from a LMDB user

To: openldap-technical@openldap.org
Subject: Question from a LMDB user
From: Tao Chen <generalmilk@gmail.com>
Date: Tue, 3 Nov 2015 15:23:17 -0500
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=to:from:subject:message-id:date:user-agent:mime-version :content-type; bh=X8jp+tOuoIsN0dIg3wytv5M6+3eqnV5Jh+o29GJmrbI=; b=b9mcoESuvK0L++rohChqJ84STw4dAlbUWWxacJsw3ht73I9L4TwNjS1KGxUPur5i8G WVFEtJGky2VV1l4fyaxNGAL3lcxUIK3dkVOTdBNtiBB4f5v9kTg1aN+cD59R59v42a1n w4HluzeB2x2ReyZsCK4argPGDqMg0z4+5d7oYel+dGE0P5GMjWh+/1K7Le+MrEbrXyDe gaHfvz82n3L8fmf/VNiMBHkltxkiQueFLQ2ePxAVenCG+K6764bDRVaWZQ+kdoSXzAw/ c1ODz1y64n3U78eVJEOWf8XhfWrAyefDBkHrKF9ncHyCqP6rx1NJPW4cHNqQiylgWOjW N+gg==
User-agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0

Hi Sir/Madam,

Recently I'm trying to use LMDB to store and randomly acess large amount of features. Each feature blob is 16kB.

Before trying LMDB, I just stack all the features together into one huge binay file, and use seek function in C++ to access each feature. Since the feature size is fixed, I can easily compute the address of each feature in the file.

Then I tried LMDB. The value is the feature as it is. The key is "1", "2", "3", .... Since 16kB is exactly 4 x page_size, adding the key and header, each feature will occupy 5 x page_size, so the db file on disk is about 1.25 times of the previous binary file, this is already a disadvantage for LMDB, but I still hope there can be some efficiency trade-off. I use LDMB++ C++ wrapper to access features.

Next, I compared two approach by accessing the same random 1% features from about 300k features. Before the test, I use vmtouch to evict both files from memory cache. The result is surprising. The one use LMDB is 1.5 times slower than the raw binary file (30s vs 20s).

Is this because the size of feature (exactly 4 pages)? Do I understand the use of LMDB incorrectly?

Thank your for your time!

Best Regards,

Tao Chen

Follow-Ups:
- Re: Question from a LMDB user
  - From: Howard Chu <hyc@symas.com>

Prev by Date: Re: OpenLDAP & SSSD Question
Next by Date: accesslog purge starves kerberos kdc authentications
Index(es):
- Chronological
- Thread