http.server
socketserver — 网络服务器框架
socketserver
源代码: Lib/http/server.py
此模块定义用于实现 HTTP 服务器的类。
警告
http.server 不推荐用于生产。它只实现了 基本安全校验 .
可用性 :非 Emscripten,非 WASI。
本模块不工作 (或不可用) 于 WebAssembly 平台 wasm32-emscripten and wasm32-wasi 。见 WebAssembly 平台 了解更多信息。
wasm32-emscripten
wasm32-wasi
一个类, HTTPServer ,是 socketserver.TCPServer 子类。它创建并监听 HTTP 套接字,将请求分派给处理程序。创建和运行服务器的代码看起来像这样:
HTTPServer
socketserver.TCPServer
def run(server_class=HTTPServer, handler_class=BaseHTTPRequestHandler): server_address = ('', 8000) httpd = server_class(server_address, handler_class) httpd.serve_forever()
This class builds on the TCPServer class by storing the server address as instance variables named server_name and server_port . The server is accessible by the handler, typically through the handler’s server 实例变量。
TCPServer
server_name
server_port
server
This class is identical to HTTPServer but uses threads to handle requests by using the ThreadingMixIn . This is useful to handle web browsers pre-opening sockets, on which HTTPServer would wait indefinitely.
ThreadingMixIn
Added in version 3.7.
The HTTPServer and ThreadingHTTPServer must be given a RequestHandlerClass on instantiation, of which this module provides three different variants:
ThreadingHTTPServer
This class is used to handle the HTTP requests that arrive at the server. By itself, it cannot respond to any actual HTTP requests; it must be subclassed to handle each request method (e.g. GET or POST). BaseHTTPRequestHandler provides a number of class and instance variables, and methods for use by subclasses.
BaseHTTPRequestHandler
The handler will parse the request and the headers, then call a method specific to the request type. The method name is constructed from the request. For example, for the request method SPAM , do_SPAM() method will be called with no arguments. All of the relevant information is stored in instance variables of the handler. Subclasses should not need to override or extend the __init__() 方法。
SPAM
do_SPAM()
__init__()
BaseHTTPRequestHandler 拥有下列实例变量:
包含元组采用形式 (host, port) 引用客户端地址。
(host, port)
包含服务器实例。
Boolean that should be set before handle_one_request() returns, indicating if another request may be expected, or if the connection should be shut down.
handle_one_request()
Contains the string representation of the HTTP request line. The terminating CRLF is stripped. This attribute should be set by handle_one_request() . If no valid request line was processed, it should be set to the empty string.
Contains the command (request type). For example, 'GET' .
'GET'
Contains the request path. If query component of the URL is present, then path includes the query. Using the terminology of RFC 3986 , path here includes hier-part 和 query .
path
hier-part
query
Contains the version string from the request. For example, 'HTTP/1.0' .
'HTTP/1.0'
Holds an instance of the class specified by the MessageClass class variable. This instance parses and manages the headers in the HTTP request. The parse_headers() function from http.client is used to parse the headers and it requires that the HTTP request provide a valid RFC 2822 style header.
MessageClass
parse_headers()
http.client
An io.BufferedIOBase input stream, ready to read from the start of the optional input data.
io.BufferedIOBase
Contains the output stream for writing a response back to the client. Proper adherence to the HTTP protocol must be used when writing to this stream in order to achieve successful interoperation with HTTP clients.
3.6 版改变: 这是 io.BufferedIOBase 流。
BaseHTTPRequestHandler has the following attributes:
Specifies the server software version. You may want to override this. The format is multiple whitespace-separated strings, where each string is of the form name[/version]. For example, 'BaseHTTP/0.2' .
'BaseHTTP/0.2'
Contains the Python system version, in a form usable by the version_string 方法和 server_version class variable. For example, 'Python/1.4' .
version_string
server_version
'Python/1.4'
Specifies a format string that should be used by send_error() method for building an error response to the client. The string is filled by default with variables from responses based on the status code that passed to send_error() .
send_error()
responses
Specifies the Content-Type HTTP header of error responses sent to the client. The default value is 'text/html' .
'text/html'
Specifies the HTTP version to which the server is conformant. It is sent in responses to let the client know the server’s communication capabilities for future requests. If set to 'HTTP/1.1' , the server will permit HTTP persistent connections; however, your server must then include an accurate Content-Length header (using send_header() ) in all of its responses to clients. For backwards compatibility, the setting defaults to 'HTTP/1.0' .
'HTTP/1.1'
Content-Length
send_header()
Specifies an email.message.Message -like class to parse HTTP headers. Typically, this is not overridden, and it defaults to http.client.HTTPMessage .
email.message.Message
http.client.HTTPMessage
This attribute contains a mapping of error code integers to two-element tuples containing a short and long message. For example, {code: (shortmessage, longmessage)} 。 shortmessage is usually used as the message key in an error response, and longmessage 作为 explain key. It is used by send_response_only() and send_error() 方法。
{code: (shortmessage, longmessage)}
send_response_only()
A BaseHTTPRequestHandler 实例具有下列方法:
调用 handle_one_request() once (or, if persistent connections are enabled, multiple times) to handle incoming HTTP requests. You should never need to override it; instead, implement appropriate do_*() 方法。
do_*()
This method will parse and dispatch the request to the appropriate do_*() method. You should never need to override it.
When an HTTP/1.1 conformant server receives an Expect: 100-continue request header it responds back with a 100 Continue followed by 200 OK headers. This method can be overridden to raise an error if the server does not want the client to continue. For e.g. server can choose to send 417 Expectation Failed as a response header and return False .
Expect: 100-continue
100 Continue
200 OK
417 Expectation Failed
return False
Added in version 3.2.
Sends and logs a complete error reply to the client. The numeric code specifies the HTTP error code, with message as an optional, short, human readable description of the error. The explain argument can be used to provide more detailed information about the error; it will be formatted using the error_message_format attribute and emitted, after a complete set of headers, as the response body. The responses attribute holds the default values for message and explain that will be used if no value is provided; for unknown codes the default value for both is the string ??? . The body will be empty if the method is HEAD or the response code is one of the following: 1xx , 204 No Content , 205 Reset Content , 304 Not Modified .
error_message_format
???
1xx
204 No Content
205 Reset Content
304 Not Modified
3.4 版改变: The error response includes a Content-Length header. Added the explain 自变量。
Adds a response header to the headers buffer and logs the accepted request. The HTTP response line is written to the internal buffer, followed by Server and 日期 headers. The values for these two headers are picked up from the version_string() and date_time_string() methods, respectively. If the server does not intend to send any other headers using the send_header() 方法,那么 send_response() should be followed by an end_headers() 调用。
version_string()
date_time_string()
send_response()
end_headers()
3.3 版改变: Headers are stored to an internal buffer and end_headers() needs to be called explicitly.
Adds the HTTP header to an internal buffer which will be written to the output stream when either end_headers() or flush_headers() is invoked. keyword should specify the header keyword, with value specifying its value. Note that, after the send_header calls are done, end_headers() MUST BE called in order to complete the operation.
flush_headers()
3.2 版改变: Headers are stored in an internal buffer.
Sends the response header only, used for the purposes when 100 Continue response is sent by the server to the client. The headers not buffered and sent directly the output stream.If the message is not specified, the HTTP message corresponding the response code is sent.
Adds a blank line (indicating the end of the HTTP headers in the response) to the headers buffer and calls flush_headers() .
3.2 版改变: The buffered headers are written to the output stream.
Finally send the headers to the output stream and flush the internal headers buffer.
Added in version 3.3.
Logs an accepted (successful) request. code should specify the numeric HTTP code associated with the response. If a size of the response is available, then it should be passed as the size 参数。
Logs an error when a request cannot be fulfilled. By default, it passes the message to log_message() , so it takes the same arguments ( format and additional values).
log_message()
Logs an arbitrary message to sys.stderr . This is typically overridden to create custom error logging mechanisms. The format argument is a standard printf-style format string, where the additional arguments to log_message() are applied as inputs to the formatting. The client ip address and current date and time are prefixed to every message logged.
sys.stderr
Returns the server software’s version string. This is a combination of the server_version and sys_version 属性。
sys_version
Returns the date and time given by timestamp (which must be None or in the format returned by time.time() ), formatted for a message header. If timestamp is omitted, it uses the current date and time.
None
time.time()
The result looks like 'Sun, 06 Nov 1994 08:49:37 GMT' .
'Sun, 06 Nov 1994 08:49:37 GMT'
Returns the current date and time, formatted for logging.
返回客户端地址。
3.3 版改变: Previously, a name lookup was performed. To avoid name resolution delays, it now always returns the IP address.
This class serves files from the directory directory and below, or the current directory if directory is not provided, directly mapping the directory structure to HTTP requests.
3.7 版改变: 添加 directory 参数。
3.9 版改变: The directory 参数接受 像路径对象 .
A lot of the work, such as parsing the request, is done by the base class BaseHTTPRequestHandler . This class implements the do_GET() and do_HEAD() 函数。
do_GET()
do_HEAD()
The following are defined as class-level attributes of SimpleHTTPRequestHandler :
SimpleHTTPRequestHandler
This will be "SimpleHTTP/" + __version__ ,其中 __version__ is defined at the module level.
"SimpleHTTP/" + __version__
__version__
A dictionary mapping suffixes into MIME types, contains custom overrides for the default system mappings. The mapping is used case-insensitively, and so should contain only lower-cased keys.
3.9 版改变: This dictionary is no longer filled with the default system mappings, but only contains overrides.
The SimpleHTTPRequestHandler class defines the following methods:
This method serves the 'HEAD' request type: it sends the headers it would send for the equivalent GET request. See the do_GET() method for a more complete explanation of the possible headers.
'HEAD'
GET
The request is mapped to a local file by interpreting the request as a path relative to the current working directory.
If the request was mapped to a directory, the directory is checked for a file named index.html or index.htm (in that order). If found, the file’s contents are returned; otherwise a directory listing is generated by calling the list_directory() method. This method uses os.listdir() to scan the directory, and returns a 404 error response if the listdir() fails.
index.html
index.htm
list_directory()
os.listdir()
404
listdir()
If the request was mapped to a file, it is opened. Any OSError exception in opening the requested file is mapped to a 404 , 'File not found' error. If there was a 'If-Modified-Since' header in the request, and the file was not modified after this time, a 304 , 'Not Modified' response is sent. Otherwise, the content type is guessed by calling the guess_type() method, which in turn uses the extensions_map variable, and the file contents are returned.
OSError
'File not found'
'If-Modified-Since'
304
'Not Modified'
guess_type()
A 'Content-type:' header with the guessed content type is output, followed by a 'Content-Length:' header with the file’s size and a 'Last-Modified:' header with the file’s modification time.
'Content-type:'
'Content-Length:'
'Last-Modified:'
Then follows a blank line signifying the end of the headers, and then the contents of the file are output. If the file’s MIME type starts with text/ the file is opened in text mode; otherwise binary mode is used.
text/
For example usage, see the implementation of the test 函数在 Lib/http/server.py .
test
3.7 版改变: 支持 'If-Modified-Since' 头。
The SimpleHTTPRequestHandler class can be used in the following manner in order to create a very basic webserver serving files relative to the current directory:
import http.server import socketserver PORT = 8000 Handler = http.server.SimpleHTTPRequestHandler with socketserver.TCPServer(("", PORT), Handler) as httpd: print("serving at port", PORT) httpd.serve_forever()
SimpleHTTPRequestHandler can also be subclassed to enhance behavior, such as using different index file names by overriding the class attribute index_pages .
index_pages
http.server can also be invoked directly using the -m switch of the interpreter. Similar to the previous example, this serves files relative to the current directory:
-m
python -m http.server
The server listens to port 8000 by default. The default can be overridden by passing the desired port number as an argument:
python -m http.server 9000
By default, the server binds itself to all interfaces. The option -b/--bind specifies a specific address to which it should bind. Both IPv4 and IPv6 addresses are supported. For example, the following command causes the server to bind to localhost only:
-b/--bind
python -m http.server --bind 127.0.0.1
3.4 版改变: 添加 --bind 选项。
--bind
3.8 版改变: Support IPv6 in the --bind 选项。
By default, the server uses the current directory. The option -d/--directory specifies a directory to which it should serve the files. For example, the following command uses a specific directory:
-d/--directory
python -m http.server --directory /tmp/
3.7 版改变: 添加 --directory 选项。
--directory
By default, the server is conformant to HTTP/1.0. The option -p/--protocol specifies the HTTP version to which the server is conformant. For example, the following command runs an HTTP/1.1 conformant server:
-p/--protocol
python -m http.server --protocol HTTP/1.1
3.11 版改变: 添加 --protocol 选项。
--protocol
This class is used to serve either files or output of CGI scripts from the current directory and below. Note that mapping HTTP hierarchic structure to local directory structure is exactly as in SimpleHTTPRequestHandler .
注意
CGI scripts run by the CGIHTTPRequestHandler class cannot execute redirects (HTTP code 302), because code 200 (script output follows) is sent prior to execution of the CGI script. This pre-empts the status code.
CGIHTTPRequestHandler
The class will however, run the CGI script, instead of serving it as a file, if it guesses it to be a CGI script. Only directory-based CGI are used — the other common server configuration is to treat special extensions as denoting CGI scripts.
The do_GET() and do_HEAD() functions are modified to run CGI scripts and serve the output, instead of serving files, if the request leads to somewhere below the cgi_directories 路径。
cgi_directories
The CGIHTTPRequestHandler defines the following data member:
This defaults to ['/cgi-bin', '/htbin'] and describes directories to treat as containing CGI scripts.
['/cgi-bin', '/htbin']
The CGIHTTPRequestHandler 定义以下方法:
This method serves the 'POST' request type, only allowed for CGI scripts. Error 501, “Can only POST to CGI scripts”, is output when trying to POST to a non-CGI url.
'POST'
Note that CGI scripts will be run with UID of user nobody, for security reasons. Problems with the CGI script will be translated to error 403.
CGIHTTPRequestHandler can be enabled in the command line by passing the --cgi 选项:
--cgi
python -m http.server --cgi
CGIHTTPRequestHandler 和 --cgi command line option are not intended for use by untrusted clients and may be vulnerable to exploitation. Always use within a secure environment.
SimpleHTTPRequestHandler 将遵循符号链接当处理请求时,这使之服务于指定目录外的文件成为可能。
Earlier versions of Python did not scrub control characters from the log messages emitted to stderr from python -m http.server or the default BaseHTTPRequestHandler .log_message implementation. This could allow remote clients connecting to your server to send nefarious control codes to your terminal.
.log_message
Changed in version 3.12: Control characters are scrubbed in stderr logs.
http.cookies — HTTP 状态管理
http.cookies
键入搜索术语或模块、类、函数名称。