Skip to content

Memory leak #66

@GoogleCodeExporter

Description

@GoogleCodeExporter
What steps will reproduce the problem?
1. Run a lot of documents through python-tesseract, use GetUTF8Text

What is the expected output? What do you see instead?
Normal expected output (success or error)

double free or corruption is seen as follows:

Error in boxClipToRectangle: box outside rectangle
Error in pixScanForForeground: invalid box
*** Error in `/usr/bin/python': double free or corruption (!prev): 
0x0000000007ba4da0 ***
Fatal Python error: Aborted

Current thread 0x00007fcacaa0e740:
  File "/usr/local/lib/python2.7/dist-packages/python_tesseract-0.9-py2.7-linux-x86_64.egg/tesseract.py", line 10484 in GetUTF8Text


What version of the product are you using? On what operating system?
ubuntu 14.04 python 2.7.8 python-tesseract 0.8-19 tesseract 3.03 leptonica-1.70 
libgif 4.1.6(?) : libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib 1.2.8 : 
webp 0.4.0

Please provide any additional information below.
On a few different types of document extraction tasks, when putting python 
tesseract through long iterative lists of extraction, typically after seeing 
some errors (which normally don't cause a crash), a double free or corruption 
segfault occurs, killing python and any associated worker

Original issue reported on code.google.com by ch...@fiscalnote.com on 2 Oct 2014 at 8:02

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions