replace ForkJoinPool with fixed thread pool #4552

vladak · 2024-02-24T09:12:20Z

This change replaces the ForkJoinPool used for distributing addFile() tasks in the 2nd phase of indexer processing with fixed thread pool. In my case:

Oracle OCI VM.Standard.E3.Flex with 16 OCPUs, 128 GB RAM running Solaris 11.4 SRU ~63 and storing OpenGrok data on block volume with 25,000 IOPS / Balanced (VPU/GB:10) / Throughput: 480 MB/s
indexer running with Java 11 with -J=-Xmx48g -J=-server using --renamedHistory 'on' -r dirbased -G -m '256' --threads 48 --historyThreads 48 --historyFileThreads 32 and annotation cache enabled for all projects
the input data consists of bigger projects like Linux kernel, FreeBSD, AOSP, Solaris Userland (prepped) - a mix of various repository types (Git, Subversion, Teamware, Mercurial)

it shaves some 8+ hours from the total indexing time:

before:

2024-02-21 16:48:04.992+0000 INFO t1 Statistics.logIt: Done indexing data of all repositories (took 1 day 21:59:50)
2024-02-21 16:48:05.842+0000 INFO t1 Indexer.writeConfigToFile: Writing configuration to '/opengrok/etc/configuration.xml'
2024-02-21 16:48:06.317+0000 INFO t1 Indexer.writeConfigToFile: Done writing configuration to '/opengrok/etc/configuration.xml'
2024-02-21 16:48:06.317+0000 INFO t1 Statistics.logIt: Indexer finished (took 2 days 06:27:12)

after:

2024-02-24 08:12:22.599+0000 INFO t1 Statistics.logIt: Done indexing data of all repositories (took 1 day 13:08:41)
2024-02-24 08:12:23.127+0000 INFO t1 Indexer.writeConfigToFile: Writing configuration to '/opengrok/etc/configuration.xml'
2024-02-24 08:12:23.565+0000 INFO t1 Indexer.writeConfigToFile: Done writing configuration to '/opengrok/etc/configuration.xml'
2024-02-24 08:12:23.566+0000 INFO t1 Statistics.logIt: Indexer finished (took 1 day 22:00:52)

The distribution of the tasks is not as uniform - more often than not a single project occupies the fixed thread pool (while always using it to its full capacity unlike the ForkJoin), however the sub-optimal scheduling is gone. Also, I don't like the hazy promise of custom ForkJoinPool to be used for parallelStream() processing.

fixes oracle#4549

vladak · 2024-02-28T16:50:39Z

Repeated the measurements using the same setup but with statsd metrics collection enabled. Also, I disabled Tomcat to avoid suggester rebuild interfering with the indexing (in terms of CPU load).

before (2 days 06:08:44):

after (1 day 20:08:07):

Now, there is something about the gaps of CPU usage in the second indexing phase, and this should certainly be investigated, however given it's overwhelmingly better that the previous solution makes this easy win.

vladak · 2024-03-04T09:06:42Z

I ran the index from scratch once again, this time watching the average load with a Python program. When the average load for the last minute dropped below 10, it grabbed jstack of the indexer process and saved it to a file. In that case, there was only one big CPU usage depression in the graph. The corresponding jstack revealed that there are 48 OpenGrok-index-worker threads (the number matches the --threads indexer argument value) in RUNNABLE state, all sharing the same stack:

"OpenGrok-index-worker-9363" #9363 prio=5 os_prio=64 cpu=29296657.62ms elapsed=66415.23s tid=0x000000002aa4d000 nid=0x255d runnable  [0x00007ff30e5a0000]
   java.lang.Thread.State: RUNNABLE
        at java.io.FileInputStream.readBytes([email protected]/Native Method)
        at java.io.FileInputStream.read([email protected]/FileInputStream.java:279)
        at java.lang.ProcessImpl$DeferredCloseInputStream.read([email protected]/ProcessImpl.java:805)
        at java.io.BufferedInputStream.read1([email protected]/BufferedInputStream.java:290)
        at java.io.BufferedInputStream.read([email protected]/BufferedInputStream.java:351)
        - locked <0x00007fff2ff7dad0> (a java.io.BufferedInputStream)
        at java.io.BufferedInputStream.fill([email protected]/BufferedInputStream.java:252)
        at java.io.BufferedInputStream.read1([email protected]/BufferedInputStream.java:292)
        at java.io.BufferedInputStream.read([email protected]/BufferedInputStream.java:351)
        - locked <0x00007fff2ff8d980> (a java.io.BufferedInputStream)
        at com.sun.org.apache.xerces.internal.impl.XMLEntityManager$RewindableInputStream.read([email protected]/XMLEntityManager.java:3064)
        at com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.read([email protected]/UTF8Reader.java:303)
        at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.load([email protected]/XMLEntityScanner.java:1904)
        at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.scanContent([email protected]/XMLEntityScanner.java:1032)
        at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next([email protected]/XMLDocumentFragmentScannerImpl.java:2744)
        at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next([email protected]/XMLDocumentScannerImpl.java:605)
        at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument([email protected]/XMLDocumentFragmentScannerImpl.java:534)
        at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse([email protected]/XML11Configuration.java:888)
        at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse([email protected]/XML11Configuration.java:824)
        at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse([email protected]/XMLParser.java:141)
        at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse([email protected]/AbstractSAXParser.java:1216)
        at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse([email protected]/SAXParserImpl.java:635)
        at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl.parse([email protected]/SAXParserImpl.java:324)
        at javax.xml.parsers.SAXParser.parse([email protected]/SAXParser.java:197)
        at org.opengrok.indexer.history.SubversionAnnotationParser.processStream(SubversionAnnotationParser.java:87)
        at org.opengrok.indexer.util.Executor.exec(Executor.java:229)
        at org.opengrok.indexer.history.SubversionRepository.annotate(SubversionRepository.java:408)
        at org.opengrok.indexer.history.HistoryGuru.getAnnotationFromRepository(HistoryGuru.java:292)
        at org.opengrok.indexer.history.HistoryGuru.createAnnotationCache(HistoryGuru.java:1167)
        at org.opengrok.indexer.index.IndexDatabase.createAnnotationCache(IndexDatabase.java:1307)
        at org.opengrok.indexer.index.IndexDatabase.addFile(IndexDatabase.java:1286)
        at org.opengrok.indexer.index.IndexDatabase.lambda$indexParallel$9(IndexDatabase.java:1920)
        at org.opengrok.indexer.index.IndexDatabase$$Lambda$1030/0x00007ff311d90168.call(Unknown Source)
        at java.util.concurrent.FutureTask.run([email protected]/FutureTask.java:264)
        at java.util.concurrent.ThreadPoolExecutor.runWorker([email protected]/ThreadPoolExecutor.java:1128)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run([email protected]/ThreadPoolExecutor.java:628)
        at java.lang.Thread.run([email protected]/Thread.java:834)

Basically, at the point the tasks being processed were all pertaining to a single project that consisted of a Subversion repository. The annotation cache was enabled and such processing is more I/O than CPU bound for Subversion, hence the decrease.

While it would be nicer if the graph was smoother, e.g. by somehow mixing the jobs from various projects more evenly, I don't see it as a blocker for integration, esp. considering that annotation cache is disabled by default.

opengrok-indexer/src/main/java/org/opengrok/indexer/index/IndexDatabase.java

replace ForkJoinPool with Executor

9dc804c

fixes oracle#4549

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Feb 24, 2024

vladak changed the title ~~replace ForkJoinPool with Executor~~ replace ForkJoinPool with fixed thread pool Feb 24, 2024

vladak requested a review from ginoaugustine March 4, 2024 10:38

ginoaugustine reviewed Mar 4, 2024

View reviewed changes

vladak added 2 commits March 5, 2024 12:35

use computeIfAbsent() instead of merge()

85aca64

compute failureCount using Optional

b59afb0

ginoaugustine approved these changes Mar 5, 2024

View reviewed changes

vladak merged commit ab6c4a3 into oracle:master Mar 5, 2024
9 checks passed

vladak deleted the bye_fork_join branch March 5, 2024 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace ForkJoinPool with fixed thread pool #4552

replace ForkJoinPool with fixed thread pool #4552

vladak commented Feb 24, 2024 •

edited

Loading

vladak commented Feb 28, 2024

vladak commented Mar 4, 2024

replace ForkJoinPool with fixed thread pool #4552

replace ForkJoinPool with fixed thread pool #4552

Conversation

vladak commented Feb 24, 2024 • edited Loading

vladak commented Feb 28, 2024

vladak commented Mar 4, 2024

vladak commented Feb 24, 2024 •

edited

Loading