バージョン 5 から バージョン 6 における更新: bigdata

差分発生行の前後
無視リスト:
更新日時:
2012/03/08 18:40:00 (13 年 前)
更新者:
wu
コメント:

--

凡例:

変更なし
追加
削除
変更
  • bigdata

    v5 v6  
    2121The RW store (Read-Write) supports recycling of allocation slots on the backing file. It may be used as a time-bounded version of an immortal database where history is aged off of the database over time. This is a good choice for standalone workloads where updates are continuously arriving and older database states may be released. The RW store is also less sensitive to data skew because it can reuse B+Tree node and leaf revisions within a commit group on large data set loads. Scaling should be better than the WORM for standalone and could reach to 10B+ triples. The default property file is attachment:RWStore.properties. 
    2222 
    23 In the test we modified the following two important parameters: 
    24 {{{ 
    25 com.bigdata.btree.writeRetentionQueue.capacity=500000 
    26 com.bigdata.rdf.sail.BigdataSail.bufferCapacity=1000000 
    27 }}} 
    28  
    2923 
    3024=== Load Performance === #load 
     
    3630Approach 2: 
    3731 
     32 Upload with com.bigdata.rdf.store.DataLoader tools and RW store default parameter. 
     33 
     34And test the situation when adding GC in JVM.  
    3835  
     36{{{ 
     37-Xmx55G -Xms30G -XX:+UseG1GC -XX:+TieredCompilation? -XX:+HeapDumpOnOutOfMemoryError 
     38}}} 
    3939 
     40Approach 3: 
     41 
     42We modified the following two important parameters(In the rest test we use this configure in default): 
     43{{{ 
     44com.bigdata.btree.writeRetentionQueue.capacity=500000 
     45com.bigdata.rdf.sail.BigdataSail.bufferCapacity=1000000 
     46}}} 
     47 
     48Approach 4: Split the file into 12 small files. 
    4049 
    4150=== Allie upload === #allieload  
     
    4453 
    4554Approach 2: 5.89hours  
    46 when Setting JVM GC : 6.75hours 
     55when Setting JVM GC : 6.75hours  
    4756 
    4857Approach 3: 2.61 hours 
    4958 
     59Approach 4: 1.03 hours   
    5060 
    5161=== PDBJ upload === #pdbjload  
    5262 
    5363 
    54 '''Result:''' 
     64'''Result:''' 8.95 hours 
    5565 
    5666=== Uniprot upload === #uniprotload