I am using xlrd
to process Excel files. I am running a script on a folder that contains many files, and I am printing messages related to the files. However, fo
Based on John Machin's answer, here is some working code (tested with Python 3.6) that works with sys.stdout
:
import io
import sys
import xlrd
path = "somefile.xls" # path to an XLS file to load or a file-like object for one
encoding_override = None
# Mute xlrd warnings for OLE inconsistencies
class LogFilter(io.TextIOWrapper):
def __init__(self, buffer=sys.stdout, *args, **kwargs):
self.buffer = buffer
super(LogFilter, self).__init__(buffer, *args, **kwargs)
def write(self, data):
if isinstance(data, str):
if not data.startswith("WARNING *** OLE2 inconsistency: "):
super(LogFilter, self).write(data)
elif isinstance(data, bytes):
super(LogFilter, self).write(data.decode(self.buffer.encoding))
else:
super(LogFilter, self).write(data)
def open_workbook(file_contents, encoding_override):
logfilter = LogFilter()
return xlrd.open_workbook(file_contents=file_contents, logfile=logfilter,
encoding_override=encoding_override)
if hasattr(path, 'read'):
book = open_workbook(file_contents=path.read(), encoding_override=encoding_override)
else:
with open(path, 'rb') as f:
book = open_workbook(file_contents=f.read(), encoding_override=encoding_override)
Check out the relevant part of the xlrd docs. The 2nd arg of the open_workbook
function is logfile
which should be an open file object or act-alike. All it needs to support is a write
method. It defaults to sys.stdout
.
So, something like this (untested) should do the job:
class MyFilter(object):
def __init__(self, mylogfile=sys.stdout):
self.f = mylogfile
def write(self, data):
if "WARNING *** OLE2 inconsistency" not in data:
self.f.write(data)
#start up
log = open("the_log_file.txt", "w")
log_filter = MyFilter(log)
book = xlrd.open_workbook("foo.xls", logfile=log_filter)
# shut down
log.close()
# or use a "with" statement
Update in response to answer by @DaniloBargen:
It's not xlrd
that's writing the newline separately, it's the Python print
statement/function. This script:
class FakeFile(object):
def write(self, data):
print repr(data)
ff = FakeFile()
for x in "foo bar baz".split():
print >> ff, x
produces this output for all Pythons 2.2 to 2.7 both inclusive:
'foo'
'\n'
'bar'
'\n'
'baz'
'\n'
A suitably modernised script (print as a function instead of a statement) produces identical output for 2.6, 2.7, 3.1, 3.2, and 3.3. You can work around this with a more complicated filter class. The following example additionally allows a sequence of phrases to be checked for:
import sys, glob, xlrd
class MyFilter(object):
def __init__(self, mylogfile=sys.stdout, skip_list=()):
self.f = mylogfile
self.state = 0
self.skip_list = skip_list
def write(self, data):
if self.state == 0:
found = any(x in data for x in self.skip_list)
if not found:
self.f.write(data)
return
if data[-1] != '\n':
self.state = 1
else:
if data != '\n':
self.f.write(data)
self.state = 0
logf = open("the_log_file.txt", "w")
skip_these = (
"WARNING *** OLE2 inconsistency",
)
try:
log_filter = MyFilter(logf, skip_these)
for fname in glob.glob(sys.argv[1]):
logf.write("=== %s ===\n" % fname)
book = xlrd.open_workbook(fname, logfile=log_filter)
finally:
logf.close()
The answer by John works, but has a small problem:
xlrd writes that warning message and the following newline character separately to the logfile. Therefore you will get an empty line in your stdout instead of the message, if you use the filter class proposed by John. You shouldn't simply filter out all newlines from the log output either though, because there could be "real" warnings that would then be missing the newlines.
If you want to simply ignore all log output by xlrd, this is probably the simplest solution:
book = xlrd.open_workbook("foo.xls", logfile=open(os.devnull, 'w'))
For what it's worth I had the same warning message; when I deleted the first row (which was empty) the warning disappeared.
import warnings
def fxn():
warnings.warn("deprecated", DeprecationWarning)
with warnings.catch_warnings():
warnings.simplefilter("ignore")
fxn()
-> http://docs.python.org/library/warnings.html#temporarily-suppressing-warnings