io — 用于处理流的核心工具

源代码: Lib/io.py


概述

io module provides Python’s main facilities for dealing with various types of I/O. There are three main types of I/O: text I/O , binary I/O and raw I/O . These are generic categories, and various backing stores can be used for each of them. A concrete object belonging to any of these categories is called a 文件对象 . Other common terms are stream and 像文件对象 .

Independent of its category, each concrete stream object will also have various capabilities: it can be read-only, write-only, or read-write. It can also allow arbitrary random access (seeking forwards or backwards to any location), or only sequential access (for example in the case of a socket or pipe).

All streams are careful about the type of data you give to them. For example giving a str object to the write() method of a binary stream will raise a TypeError . So will giving a bytes object to the write() method of a text stream.

3.3 版改变: 操作用于引发 IOError 现在引发 OSError ,由于 IOError 现在是别名化的 OSError .

文本 I/O

Text I/O 期望和产生 str objects. This means that whenever the backing store is natively made of bytes (such as in the case of a file), encoding and decoding of data is made transparently as well as optional translation of platform-specific newline characters.

The easiest way to create a text stream is with open() , optionally specifying an encoding:

f = open("myfile.txt", "r", encoding="utf-8")
									

In-memory text streams are also available as StringIO 对象:

f = io.StringIO("some initial text data")
									

The text stream API is described in detail in the documentation of TextIOBase .

二进制 I/O

二进制 I/O (也称 buffered I/O ) 期望 像字节对象 和产生 bytes objects. No encoding, decoding, or newline translation is performed. This category of streams can be used for all kinds of non-text data, and also when manual control over the handling of text data is desired.

The easiest way to create a binary stream is with open() with 'b' in the mode string:

f = open("myfile.jpg", "rb")
									

In-memory binary streams are also available as BytesIO 对象:

f = io.BytesIO(b"some initial binary data: \x00\x01")
									

The binary stream API is described in detail in the docs of BufferedIOBase .

Other library modules may provide additional ways to create text or binary streams. See socket.socket.makefile() 例如。

原生 I/O

原生 I/O (也称 unbuffered I/O ) is generally used as a low-level building-block for binary and text streams; it is rarely useful to directly manipulate a raw stream from user code. Nevertheless, you can create a raw stream by opening a file in binary mode with buffering disabled:

f = open("myfile.jpg", "rb", buffering=0)
									

The raw stream API is described in detail in the docs of RawIOBase .

高级模块接口

io. DEFAULT_BUFFER_SIZE

An int containing the default buffer size used by the module’s buffered I/O classes. open() uses the file’s blksize (as obtained by os.stat() ) 若可能的话。

io. open ( file , mode='r' , buffering=-1 , encoding=None , errors=None , newline=None , closefd=True , opener=None )

This is an alias for the builtin open() 函数。

此函数引发 审计事件 open 采用自变量 path , mode and flags mode and flags arguments may have been modified or inferred from the original call.

io. open_code ( path )

Opens the provided file with mode 'rb' . This function should be used when the intent is to treat the contents as executable code.

path 应该为 str and an absolute path.

The behavior of this function may be overridden by an earlier call to the PyFile_SetOpenCodeHook() . However, assuming that path str 和绝对路径, open_code(path) should always behave the same as open(path, 'rb') . Overriding the behavior is intended for additional validation or preprocessing of the file.

3.8 版新增。

exception io. BlockingIOError

This is a compatibility alias for the builtin BlockingIOError 异常。

exception io. UnsupportedOperation

异常继承 OSError and ValueError that is raised when an unsupported operation is called on a stream.

另请参阅

sys

包含标准 IO 流: sys.stdin , sys.stdout ,和 sys.stderr .

类层次结构

The implementation of I/O streams is organized as a hierarchy of classes. First 抽象基类 (ABCs), which are used to specify the various categories of streams, then concrete classes providing the standard stream implementations.

注意

The abstract base classes also provide default implementations of some methods in order to help implementation of concrete stream classes. For example, BufferedIOBase provides unoptimized implementations of readinto() and readline() .

At the top of the I/O hierarchy is the abstract base class IOBase . It defines the basic interface to a stream. Note, however, that there is no separation between reading and writing to streams; implementations are allowed to raise UnsupportedOperation 若它们不支持给定操作。

RawIOBase ABC (抽象基类) 扩展 IOBase . It deals with the reading and writing of bytes to a stream. FileIO 子类 RawIOBase to provide an interface to files in the machine’s file system.

BufferedIOBase ABC (抽象基类) 扩展 IOBase . It deals with buffering on a raw binary stream ( RawIOBase )。其子类, BufferedWriter , BufferedReader ,和 BufferedRWPair buffer raw binary streams that are readable, writable, and both readable and writable, respectively. BufferedRandom provides a buffered interface to seekable streams. Another BufferedIOBase 子类, BytesIO , is a stream of in-memory bytes.

TextIOBase ABC (抽象基类) 扩展 IOBase . It deals with streams whose bytes represent text, and handles encoding and decoding to and from strings. TextIOWrapper , which extends TextIOBase , is a buffered text interface to a buffered raw stream ( BufferedIOBase ). Finally, StringIO is an in-memory stream for text.

Argument names are not part of the specification, and only the arguments of open() are intended to be used as keyword arguments.

下表汇总的 ABC (抽象基类) 提供通过 io 模块:

ABC

继承

Stub 方法

混合方法和特性

IOBase

fileno , seek ,和 truncate

close , closed , __enter__ , __exit__ , flush , isatty , __iter__ , __next__ , readable , readline , readlines , seekable , tell , writable ,和 writelines

RawIOBase

IOBase

readinto and write

继承 IOBase 方法, read ,和 readall

BufferedIOBase

IOBase

detach , read , read1 ,和 write

继承 IOBase 方法, readinto ,和 readinto1

TextIOBase

IOBase

detach , read , readline ,和 write

继承 IOBase 方法, encoding , errors ,和 newlines

I/O 基类

class io. IOBase

所有 I/O 类的抽象基类,作用于字节流。没有公共构造函数。

This class provides empty abstract implementations for many methods that derived classes can override selectively; the default implementations represent a file that cannot be read, written or seeked.

Even though IOBase does not declare read() or write() because their signatures will vary, implementations and clients should consider those methods part of the interface. Also, implementations may raise a ValueError (或 UnsupportedOperation ) when operations they do not support are called.

The basic type used for binary data read from or written to a file is bytes 。其它 像字节对象 are accepted as method arguments too. Text I/O classes work with str 数据。

Note that calling any method (even inquiries) on a closed stream is undefined. Implementations may raise ValueError 在这种情况下。

IOBase (and its subclasses) supports the iterator protocol, meaning that an IOBase object can be iterated over yielding the lines in a stream. Lines are defined slightly differently depending on whether the stream is a binary stream (yielding bytes), or a text stream (yielding character strings). See readline() below.

IOBase 也是上下文管理器,因此支持 with 语句。在此范例中, file 被关闭后于 with 语句的套件完成 — 即使发生异常:

with open('spam.txt', 'w') as file:
    file.write('Spam and eggs!')
											

IOBase 提供这些数据属性和方法:

close ( )

Flush and close this stream. This method has no effect if the file is already closed. Once the file is closed, any operation on the file (e.g. reading or writing) will raise a ValueError .

As a convenience, it is allowed to call this method more than once; only the first call, however, will have an effect.

closed

True 若流被关闭。

fileno ( )

返回流的底层文件描述符 (整数) 若存在。 OSError 被引发若 IO 对象不使用文件描述符。

flush ( )

刷新流的写入缓冲若适用。这什么都不做,对于只读和非阻塞流。

isatty ( )

返回 True 若流可交互 (即:连接到终端/tty 设备)。

readable ( )

返回 True 若流可以被读取。若 False , read() 会引发 OSError .

readline ( size=-1 )

从流读取并返回一行。若 size 有指定,最多 size 字节将被读取。

行终止符始终是 b'\n' 对于二进制文件;对于文本文件, newline 自变量为 open() 可以是用于识别行终止符的选择。

readlines ( hint=-1 )

从流读取并返回行列表。 hint 可以指定要控制的读取行数:没有更多行将被读取,若到目前为止的所有行总大小 (以字节/字符为单位) 超过 hint .

hint 值为 0 或小于,及 None ,视为没有提示。

注意,迭代文件对象已经是可能的使用 for line in file: ... 不调用 file.readlines() .

seek ( offset , whence=SEEK_SET )

将流位置更改为给定字节 offset . offset 的解释相对位置指示通过 whence 。默认值对于 whence is SEEK_SET 。值对于 whence 是:

  • SEEK_SET or 0 – 流的开头 (默认); offset 应该为 0 或正值

  • SEEK_CUR or 1 – 当前流位置; offset 可能为负值

  • SEEK_END or 2 – 流的结尾; offset 通常为负值

返回新的绝对位置。

3.1 版新增: SEEK_* 常量。

3.3 版新增: 某些操作系统可以支持额外值,像 os.SEEK_HOLE or os.SEEK_DATA 。文件有效值从属它是以文本模式打开,还是以二进制模式打开。

seekable ( )

返回 True 若流支持随机访问。若 False , seek() , tell() and truncate() 会引发 OSError .

tell ( )

返回当前流的位置。

truncate ( size=None )

Resize the stream to the given size in bytes (or the current position if size is not specified). The current stream position isn’t changed. This resizing can extend or reduce the current file size. In case of extension, the contents of the new file area depend on the platform (on most systems, additional bytes are zero-filled). The new file size is returned.

3.5 版改变: Windows 现在以 0 填充文件当扩展时。

writable ( )

返回 True 若流支持写入。若 False , write() and truncate() 会引发 OSError .

writelines ( lines )

Write a list of lines to the stream. Line separators are not added, so it is usual for each of the lines provided to have a line separator at the end.

__del__ ( )

准备销毁对象。 IOBase provides a default implementation of this method that calls the instance’s close() 方法。

class io. RawIOBase

用于原生二进制流的基类。它继承 IOBase 。没有公共构造函数。

Raw binary streams typically provide low-level access to an underlying OS device or API, and do not try to encapsulate it in high-level primitives (this functionality is done at a higher-level in buffered binary streams and text streams, described later in this page).

RawIOBase provides these methods in addition to those from IOBase :

read ( size=-1 )

读取直到 size bytes from the object and return them. As a convenience, if size is unspecified or -1, all bytes until EOF are returned. Otherwise, only one system call is ever made. Fewer than size bytes may be returned if the operating system call returns fewer than size 字节。

若返回 0 字节,和 size was not 0, this indicates end of file. If the object is in non-blocking mode and no bytes are available, None 被返回。

默认实现延期到 readall() and readinto() .

readall ( )

Read and return all the bytes from the stream until EOF, using multiple calls to the stream if necessary.

readinto ( b )

将字节读入预分配,可写 像字节对象 b , and return the number of bytes read. For example, b might be a bytearray . If the object is in non-blocking mode and no bytes are available, None 被返回。

write ( b )

写入给定 像字节对象 , b , to the underlying raw stream, and return the number of bytes written. This can be less than the length of b in bytes, depending on specifics of the underlying raw stream, and especially if it is in non-blocking mode. None is returned if the raw stream is set not to block and no single byte could be readily written to it. The caller may release or mutate b after this method returns, so the implementation should only access b 在方法调用期间。

class io. BufferedIOBase

Base class for binary streams that support some kind of buffering. It inherits IOBase 。没有公共构造函数。

The main difference with RawIOBase is that methods read() , readinto() and write() will try (respectively) to read as much input as requested or to consume all given output, at the expense of making perhaps more than one system call.

In addition, those methods can raise BlockingIOError if the underlying raw stream is in non-blocking mode and cannot take or give enough data; unlike their RawIOBase counterparts, they will never return None .

Besides, the read() method does not have a default implementation that defers to readinto() .

典型 BufferedIOBase implementation should not inherit from a RawIOBase implementation, but wrap one, like BufferedWriter and BufferedReader do.

BufferedIOBase provides or overrides these data attributes and methods in addition to those from IOBase :

raw

底层原生流 ( RawIOBase 实例) BufferedIOBase deals with. This is not part of the BufferedIOBase API and may not exist on some implementations.

detach ( )

Separate the underlying raw stream from the buffer and return it.

After the raw stream has been detached, the buffer is in an unusable state.

Some buffers, like BytesIO , do not have the concept of a single raw stream to return from this method. They raise UnsupportedOperation .

3.1 版新增。

read ( size=-1 )

读取并返回直到 size bytes. If the argument is omitted, None , or negative, data is read and returned until EOF is reached. An empty bytes object is returned if the stream is already at EOF.

If the argument is positive, and the underlying raw stream is not interactive, multiple raw reads may be issued to satisfy the byte count (unless EOF is reached first). But for interactive raw streams, at most one raw read will be issued, and a short result does not imply that EOF is imminent.

BlockingIOError is raised if the underlying raw stream is in non blocking-mode, and has no data available at the moment.

read1 ( [ size ] )

读取并返回直到 size bytes, with at most one call to the underlying raw stream’s read() (或 readinto() ) method. This can be useful if you are implementing your own buffering on top of a BufferedIOBase 对象。

size is -1 (the default), an arbitrary number of bytes are returned (more than zero unless EOF is reached).

readinto ( b )

将字节读入预分配,可写 像字节对象 b and return the number of bytes read. For example, b might be a bytearray .

read() , multiple reads may be issued to the underlying raw stream, unless the latter is interactive.

BlockingIOError is raised if the underlying raw stream is in non blocking-mode, and has no data available at the moment.

readinto1 ( b )

将字节读入预分配,可写 像字节对象 b , using at most one call to the underlying raw stream’s read() (或 readinto() ) method. Return the number of bytes read.

BlockingIOError is raised if the underlying raw stream is in non blocking-mode, and has no data available at the moment.

3.5 版新增。

write ( b )

写入给定 像字节对象 , b , and return the number of bytes written (always equal to the length of b in bytes, since if the write fails an OSError will be raised). Depending on the actual implementation, these bytes may be readily written to the underlying stream, or held in a buffer for performance and latency reasons.

当在非阻塞模式下时, BlockingIOError is raised if the data needed to be written to the raw stream but it couldn’t accept all the data without blocking.

The caller may release or mutate b after this method returns, so the implementation should only access b 在方法调用期间。

原生文件 I/O

class io. FileIO ( name , mode='r' , closefd=True , opener=None )

A raw binary stream representing an OS-level file containing bytes data. It inherits RawIOBase .

name can be one of two things:

  • 字符串或 bytes object representing the path to the file which will be opened. In this case closefd must be True (the default) otherwise an error will be raised.

  • an integer representing the number of an existing OS-level file descriptor to which the resulting FileIO object will give access. When the FileIO object is closed this fd will be closed as well, unless closefd 被设为 False .

mode 可以是 'r' , 'w' , 'x' or 'a' for reading (default), writing, exclusive creation or appending. The file will be created if it doesn’t exist when opened for writing or appending; it will be truncated when opened for writing. FileExistsError will be raised if it already exists when opened for creating. Opening a file for creating implies writing, so this mode behaves in a similar way to 'w' . Add a '+' to the mode to allow simultaneous reading and writing.

read() (when called with a positive argument), readinto() and write() methods on this class will only make one system call.

A custom opener can be used by passing a callable as opener . The underlying file descriptor for the file object is then obtained by calling opener with ( name , flags ). opener must return an open file descriptor (passing os.open as opener results in functionality similar to passing None ).

新近创建的文件 不可继承 .

open() built-in function for examples on using the opener 参数。

3.3 版改变: opener 参数被添加。 'x' 模式被添加。

3.4 版改变: 文件现在不可继承。

FileIO provides these data attributes in addition to those from RawIOBase and IOBase :

mode

The mode as given in the constructor.

name

The file name. This is the file descriptor of the file when no name is given in the constructor.

缓冲流

缓冲 I/O 流为 I/O 设备提供更高级接口,相比原生 I/O。

class io. BytesIO ( [ initial_bytes ] )

A binary stream using an in-memory bytes buffer. It inherits BufferedIOBase . The buffer is discarded when the close() 方法被调用。

可选自变量 initial_bytes 像字节对象 that contains initial data.

BytesIO provides or overrides these methods in addition to those from BufferedIOBase and IOBase :

getbuffer ( )

Return a readable and writable view over the contents of the buffer without copying them. Also, mutating the view will transparently update the contents of the buffer:

>>> b = io.BytesIO(b"abcdef")
>>> view = b.getbuffer()
>>> view[2:4] = b"56"
>>> b.getvalue()
b'ab56ef'
													

注意

只要视图存在, BytesIO 对象不可以重置大小或关闭。

3.2 版新增。

getvalue ( )

返回 bytes 包含缓冲的整个内容。

read1 ( [ size ] )

BytesIO ,这如同 read() .

3.7 版改变: size 自变量现在是可选的。

readinto1 ( b )

BytesIO ,这如同 readinto() .

3.5 版新增。

class io. BufferedReader ( raw , buffer_size=DEFAULT_BUFFER_SIZE )

A buffered binary stream providing higher-level access to a readable, non seekable RawIOBase raw binary stream. It inherits BufferedIOBase .

When reading data from this object, a larger amount of data may be requested from the underlying raw stream, and kept in an internal buffer. The buffered data can then be returned directly on subsequent reads.

构造函数创建 BufferedReader for the given readable raw stream and buffer_size 。若 buffer_size is omitted, DEFAULT_BUFFER_SIZE 被使用。

BufferedReader provides or overrides these methods in addition to those from BufferedIOBase and IOBase :

peek ( [ size ] )

Return bytes from the stream without advancing the position. At most one single read on the raw stream is done to satisfy the call. The number of bytes returned may be less or more than requested.

read ( [ size ] )

Read and return size bytes, or if size is not given or negative, until EOF or if the read call would block in non-blocking mode.

read1 ( [ size ] )

读取并返回直到 size bytes with only one call on the raw stream. If at least one byte is buffered, only buffered bytes are returned. Otherwise, one raw stream read call is made.

3.7 版改变: size 自变量现在是可选的。

class io. BufferedWriter ( raw , buffer_size=DEFAULT_BUFFER_SIZE )

A buffered binary stream providing higher-level access to a writeable, non seekable RawIOBase raw binary stream. It inherits BufferedIOBase .

When writing to this object, data is normally placed into an internal buffer. The buffer will be written out to the underlying RawIOBase object under various conditions, including:

构造函数创建 BufferedWriter for the given writeable raw stream. If the buffer_size 不给定,默认为 DEFAULT_BUFFER_SIZE .

BufferedWriter provides or overrides these methods in addition to those from BufferedIOBase and IOBase :

flush ( )

Force bytes held in the buffer into the raw stream. A BlockingIOError should be raised if the raw stream blocks.

write ( b )

写入 像字节对象 , b , and return the number of bytes written. When in non-blocking mode, a BlockingIOError is raised if the buffer needs to be written out but the raw stream blocks.

class io. BufferedRandom ( raw , buffer_size=DEFAULT_BUFFER_SIZE )

A buffered binary stream providing higher-level access to a seekable RawIOBase raw binary stream. It inherits BufferedReader and BufferedWriter .

The constructor creates a reader and writer for a seekable raw stream, given in the first argument. If the buffer_size is omitted it defaults to DEFAULT_BUFFER_SIZE .

BufferedRandom is capable of anything BufferedReader or BufferedWriter can do. In addition, seek() and tell() are guaranteed to be implemented.

class io. BufferedRWPair ( reader , writer , buffer_size=DEFAULT_BUFFER_SIZE )

A buffered binary stream providing higher-level access to two non seekable RawIOBase raw binary streams—one readable, the other writeable. It inherits BufferedIOBase .

reader and writer are RawIOBase objects that are readable and writeable respectively. If the buffer_size is omitted it defaults to DEFAULT_BUFFER_SIZE .

BufferedRWPair implements all of BufferedIOBase ’s methods except for detach() , which raises UnsupportedOperation .

警告

BufferedRWPair does not attempt to synchronize accesses to its underlying raw streams. You should not pass it the same object as reader and writer; use BufferedRandom 代替。

文本 I/O

class io. TextIOBase

Base class for text streams. This class provides a character and line based interface to stream I/O. It inherits IOBase 。没有公共构造函数。

TextIOBase provides or overrides these data attributes and methods in addition to those from IOBase :

encoding

The name of the encoding used to decode the stream’s bytes into strings, and to encode strings into bytes.

errors

The error setting of the decoder or encoder.

newlines

A string, a tuple of strings, or None , indicating the newlines translated so far. Depending on the implementation and the initial constructor flags, this may not be available.

buffer

The underlying binary buffer (a BufferedIOBase 实例) TextIOBase deals with. This is not part of the TextIOBase API and may not exist in some implementations.

detach ( )

Separate the underlying binary buffer from the TextIOBase and return it.

After the underlying buffer has been detached, the TextIOBase is in an unusable state.

Some TextIOBase 实现,像 StringIO , may not have the concept of an underlying buffer and calling this method will raise UnsupportedOperation .

3.1 版新增。

read ( size=-1 )

读取并返回最多 size characters from the stream as a single str 。若 size is negative or None , reads until EOF.

readline ( size=-1 )

Read until newline or EOF and return a single str . If the stream is already at EOF, an empty string is returned.

size 有指定,最多 size 字符将被读取。

seek ( offset , whence=SEEK_SET )

Change the stream position to the given offset . Behaviour depends on the whence parameter. The default value for whence is SEEK_SET .

  • SEEK_SET or 0 : seek from the start of the stream (the default); offset must either be a number returned by TextIOBase.tell() , or zero. Any other offset value produces undefined behaviour.

  • SEEK_CUR or 1 : “seek” to the current position; offset must be zero, which is a no-operation (all other values are unsupported).

  • SEEK_END or 2 : seek to the end of the stream; offset must be zero (all other values are unsupported).

以不透明数字形式返回新的绝对位置。

3.1 版新增: SEEK_* 常量。

tell ( )

Return the current stream position as an opaque number. The number does not usually represent a number of bytes in the underlying binary storage.

write ( s )

Write the string s to the stream and return the number of characters written.

class io. TextIOWrapper ( buffer , encoding=None , errors=None , newline=None , line_buffering=False , write_through=False )

A buffered text stream providing higher-level access to a BufferedIOBase buffered binary stream. It inherits TextIOBase .

encoding gives the name of the encoding that the stream will be decoded or encoded with. It defaults to locale.getpreferredencoding(False) .

errors is an optional string that specifies how encoding and decoding errors are to be handled. Pass 'strict' to raise a ValueError exception if there is an encoding error (the default of None has the same effect), or pass 'ignore' to ignore errors. (Note that ignoring encoding errors can lead to data loss.) 'replace' causes a replacement marker (such as '?' ) to be inserted where there is malformed data. 'backslashreplace' causes malformed data to be replaced by a backslashed escape sequence. When writing, 'xmlcharrefreplace' (replace with the appropriate XML character reference) or 'namereplace' (replace with \N{...} escape sequences) can be used. Any other error handling name that has been registered with codecs.register_error() is also valid.

newline controls how line endings are handled. It can be None , '' , '\n' , '\r' ,和 '\r\n' . It works as follows:

  • When reading input from the stream, if newline is None , 通用换行符 mode is enabled. Lines in the input can end in '\n' , '\r' ,或 '\r\n' , and these are translated into '\n' before being returned to the caller. If newline is '' , universal newlines mode is enabled, but line endings are returned to the caller untranslated. If newline has any of the other legal values, input lines are only terminated by the given string, and the line ending is returned to the caller untranslated.

  • When writing output to the stream, if newline is None , any '\n' characters written are translated to the system default line separator, os.linesep 。若 newline is '' or '\n' , no translation takes place. If newline is any of the other legal values, any '\n' characters written are translated to the given string.

line_buffering is True , flush() is implied when a call to write contains a newline character or a carriage return.

write_through is True , calls to write() are guaranteed not to be buffered: any data written on the TextIOWrapper object is immediately handled to its underlying binary buffer .

3.3 版改变: write_through argument has been added.

3.3 版改变: 默认 encoding 现为 locale.getpreferredencoding(False) 而不是 locale.getpreferredencoding() . Don’t change temporary the locale encoding using locale.setlocale() , use the current locale encoding instead of the user preferred encoding.

TextIOWrapper provides these data attributes and methods in addition to those from TextIOBase and IOBase :

line_buffering

行缓冲是否被启用。

write_through

Whether writes are passed immediately to the underlying binary buffer.

3.7 版新增。

reconfigure ( *[, encoding][, errors][, newline][, line_buffering][, write_through] )

Reconfigure this text stream using new settings for encoding , errors , newline , line_buffering and write_through .

Parameters not specified keep current settings, except errors='strict' is used when encoding is specified but errors is not specified.

It is not possible to change the encoding or newline if some data has already been read from the stream. On the other hand, changing encoding after write is possible.

This method does an implicit stream flush before setting the new parameters.

3.7 版新增。

class io. StringIO ( initial_value='' , newline='\n' )

A text stream using an in-memory text buffer. It inherits TextIOBase .

The text buffer is discarded when the close() 方法被调用。

The initial value of the buffer can be set by providing initial_value . If newline translation is enabled, newlines will be encoded as if by write() . The stream is positioned at the start of the buffer.

newline argument works like that of TextIOWrapper , except that when writing output to the stream, if newline is None , newlines are written as \n on all platforms.

StringIO provides this method in addition to those from TextIOBase and IOBase :

getvalue ( )

返回 str containing the entire contents of the buffer. Newlines are decoded as if by read() , although the stream position is not changed.

用法范例:

import io
output = io.StringIO()
output.write('First line.\n')
print('Second line.', file=output)
# Retrieve file contents -- this will be
# 'First line.\nSecond line.\n'
contents = output.getvalue()
# Close object and discard memory buffer --
# .getvalue() will now raise an exception.
output.close()
											
class io. IncrementalNewlineDecoder

A helper codec that decodes newlines for 通用换行符 模式。它继承 codecs.IncrementalDecoder .

性能

This section discusses the performance of the provided concrete I/O implementations.

二进制 I/O

By reading and writing only large chunks of data even when the user asks for a single byte, buffered I/O hides any inefficiency in calling and executing the operating system’s unbuffered I/O routines. The gain depends on the OS and the kind of I/O which is performed. For example, on some modern OSes such as Linux, unbuffered disk I/O can be as fast as buffered I/O. The bottom line, however, is that buffered I/O offers predictable performance regardless of the platform and the backing device. Therefore, it is almost always preferable to use buffered I/O rather than unbuffered I/O for binary data.

文本 I/O

Text I/O over a binary storage (such as a file) is significantly slower than binary I/O over the same storage, because it requires conversions between unicode and binary data using a character codec. This can become noticeable handling huge amounts of text data like large log files. Also, TextIOWrapper.tell() and TextIOWrapper.seek() are both quite slow due to the reconstruction algorithm used.

StringIO , however, is a native in-memory unicode container and will exhibit similar speed to BytesIO .

多线程

FileIO objects are thread-safe to the extent that the operating system calls (such as read(2) under Unix) they wrap are thread-safe too.

Binary buffered objects (instances of BufferedReader , BufferedWriter , BufferedRandom and BufferedRWPair ) protect their internal structures using a lock; it is therefore safe to call them from multiple threads at once.

TextIOWrapper 对象不是线程安全的。

重入

Binary buffered objects (instances of BufferedReader , BufferedWriter , BufferedRandom and BufferedRWPair ) are not reentrant. While reentrant calls will not happen in normal situations, they can arise from doing I/O in a signal handler. If a thread tries to re-enter a buffered object which it is already accessing, a RuntimeError is raised. Note this doesn’t prohibit a different thread from entering the buffered object.

The above implicitly extends to text files, since the open() function will wrap a buffered object inside a TextIOWrapper . This includes standard streams and therefore affects the built-in print() function as well.