バージョン 1 から バージョン 2 における更新: OwlimSe

差分発生行の前後
無視リスト:
更新日時:
2012/03/07 15:17:53 (12 年 前)
更新者:
wu
コメント:

--

凡例:

変更なし
追加
削除
変更
  • OwlimSe

    v1 v2  
    1717 
    1818{{{ 
    19 NumberOfBuffers          = 6500000 
    20 MaxDirtyBuffers          = 5000000 
    21 AsyncQueryMaxThreads     = 18 
    22 ThreadsPerQuery          = 18 
    23 IndexTreeMaps            = 512 
    24 ThreadCleanupInterval    = 1 
    25 ResourcesCleanupInterval = 1 
     19JVMSetting: 
     20 -Xmx55G -Xms30G -XX:+UseG1GC -XX:+TieredCompilation 
     21-Druleset=empty -Dentity-index-size=1147483647 -Dcache-memory=16645m -Dtuple-index-memory=15G -DenablePredicateList=false  -DftsIndexPolicy=never  -Dbuild-pcsot=false -Dbuild-ptsoc=false  -Djournaling=true -Drepository-type=file-repository  -Dentity-id-size=32   
    2622}}} 
    2723 
    28  
    29 More information please refer to [http://docs.openlinksw.com/virtuoso/databaseadmsrv.html] [http://www.openlinksw.com/weblog/oerling/?id=1665] 
     24More information please refer to [http://docs.openlinksw.com/virtuoso/databaseadmsrv.html]  
    3025 
    3126 
    3227=== Load Performance === #load 
    3328 
    34 46mins22secs 
     29'''Approach 1:''' 
     30 
     31'''Approach 2:''' 
     32 
     33The idea is from uniprot, which uses owlim as an library as follows: 
     34 
     35Basically They have one specific loader program, where there is one java thread that reads the triples into a blocking queue. Then a different number of threads take triples from that queue and insert the data into OWLIM-se (or any other sesame API compatible triplestore). Normally one inserting thread per owlim file-repository fragment. The inserter treads use transactions that commit every half a million statements. The basic is to add statements not files. 
     36 
     37    final org.openrdf.model.Statement sesameStatement = getSesameStatement(object); 
     38 
     39    //Takes one from the blocking queue filled by the other thread 
     40 
     41        connection.add(sesameStatement, graph); 
     42 
     43and every millionth statement , do connection.commit(); 
    3544 
    3645