✔ When to use IOBuffer? · helpdesk (published)

Reading from io = open(fname) is much slower than reading from IOBuffer(read(io)). I understand that this is because all the bytes are copied to RAM.

If we can guarantee that all bytes in io fit in RAM, then it is always better to use IOBuffer for speed?

Sukera (Nov 22 2025 at 12:04):

How are you measuring? If you exclude the time it takes to reads the data into RAM for the IOBuffer from your measurement, of course it's faster. Only the processing time is left after all. However, your overall application likely is not magically fast, because the data still has to be read into RAM.

Júlio Hoffimann (Nov 22 2025 at 12:05):

I am including the time it takes to create the IOBuffer and it is still faster.

Sukera (Nov 22 2025 at 12:41):

Júlio Hoffimann (Nov 22 2025 at 12:53):

Can try to produce one with more time later. Currently trying to debug another issue.

Júlio Hoffimann (Nov 22 2025 at 14:30):

The example could be as simple as reading a matrix of size 2500×25000 stored in the IO:

for j in 1:25000
  for i in 1:2500
    read(io, Float64)
  end
end

Jakob Nybo Andersen (Nov 22 2025 at 17:01):

That is almost certainly because the file object (IOStream) has some overhead, mostly from taking a lock associated with the file. You can match the performance of the IOBuffer by buffering the file object in Julia. That is, you make a small buffer, e.g. a 16 KiB Vector{UInt8}, then read into that

Júlio Hoffimann (Nov 22 2025 at 17:02):

Gunnar Farnebäck (Nov 22 2025 at 17:34):

help?> read!
search: read! read real rpad readdir Threads isready prepend! readeach readline readlink replace!

  read!(stream::IO, array::AbstractArray)
  read!(filename::AbstractString, array::AbstractArray)

  Read binary data from an I/O stream or file, filling in array.

Jakob Nybo Andersen (Nov 22 2025 at 17:49):

For what it's worth, I'm quite unhappy with the API that Base provides, which is why I made the package BufferIO.jl to improve this area of Julia

Jakob Nybo Andersen (Nov 22 2025 at 17:49):

For a more mature, though less efficient alternative, look at BufferedStreams.jl

Júlio Hoffimann (Nov 22 2025 at 17:54):

Notification Bot (Nov 22 2025 at 19:09):

Nathan Zimmerberg (Nov 24 2025 at 20:14):

Júlio Hoffimann (Nov 24 2025 at 20:18):

Nathan Zimmerberg (Nov 24 2025 at 20:28):

If you have the memory it will almost always be nicer to just read everything into memory first and do your your processing on a big Vector{UInt8}. Beyond performance doing this greatly simplifies the logic for error handling, because any IO errors can be handled up front, also, unlike IO, the Vector interface is well documented.

Stream: helpdesk (published)

Topic: ✔ When to use IOBuffer?

Júlio Hoffimann (Nov 22 2025 at 11:07):

Sukera (Nov 22 2025 at 12:04):

Júlio Hoffimann (Nov 22 2025 at 12:05):

Sukera (Nov 22 2025 at 12:41):

Júlio Hoffimann (Nov 22 2025 at 12:53):

Júlio Hoffimann (Nov 22 2025 at 14:30):

Jakob Nybo Andersen (Nov 22 2025 at 17:01):

Júlio Hoffimann (Nov 22 2025 at 17:02):

Gunnar Farnebäck (Nov 22 2025 at 17:34):

Jakob Nybo Andersen (Nov 22 2025 at 17:49):

Jakob Nybo Andersen (Nov 22 2025 at 17:49):

Júlio Hoffimann (Nov 22 2025 at 17:54):

Notification Bot (Nov 22 2025 at 19:09):

Nathan Zimmerberg (Nov 24 2025 at 20:14):

Júlio Hoffimann (Nov 24 2025 at 20:18):

Nathan Zimmerberg (Nov 24 2025 at 20:28):