Access File through multiple threads

后端 未结 10 801
天涯浪人
天涯浪人 2021-01-31 10:52

I want to access a large file (file size may vary from 30 MB to 1 GB) through 10 threads and then process each line in the file and write them to another file through 10 threads

10条回答
  •  遇见更好的自我
    2021-01-31 11:41

    Any sort of IO whether it be disk, network, etc. is generally the bottleneck.

    By using multiple threads you are exacerbating the problem as it is very likely only one thread can have access to the IO resource at one time.

    It would be best to use one thread to read, pass off info to a worker pool of threads, and then writing directly from there. But again if the workers write to the same place there will be bottlenecks as only one can have the lock. Easily fixed by passing the data to a single writer thread.

    In "short":

    Single reader thread writes to BlockingQueue or the like, this gives it a natural ordered sequence.

    Then worker pool threads wait on the queue for data, recording its sequence number.

    Worker threads then write the processed data to another BlockingQueue this time attaching its original sequence number so that

    The writer thread can take the data and write it in sequence.

    This will likely yield the fastest implementation possible.

提交回复
热议问题