Hi there !
Could somebody help me please ?
I try to do extract text for a document with FineReader.
It works fine with black text on white background.
The problem is that some parts of the document have black text on a kind of
gray background.
In fact this gray background is made of small black dots, like the text, and
the OCR can get it.
Is it possible to remove the background with Photoshop to have only the text
left ?
Hope my english is understandable.
Thanks.
Ben.
Ben wrote:
> Hi there !
>
> Could somebody help me please ?
>
> I try to do extract text for a document with FineReader.
> It works fine with black text on white background.
> The problem is that some parts of the document have black text on a kind o
f
> gray background.
> In fact this gray background is made of small black dots, like the text, a
nd
> the OCR can get it.
>
> Is it possible to remove the background with Photoshop to have only the te
xt
> left ?
>
> Hope my english is understandable.
>
> Thanks.
>
> Ben.
>
>
Make a mask of just the text minus the background in PS. If the image,
zoomed at 100% looks destinct (color difference between the gray and
black) enough, then use either the color range tool or if the colors are
too similar, then use the path tool to do it by hand. Once you have
selected the text that you want, click CMND or CTRL J to create a new
layer of just your selection which will be that text you want. Once
done, go back to the original layer, make sure that nothing is selected
and fill it in with pure white. Flatten the layer and try your OCR
software with that.
--
__________________________________________________
Leo McKenzie
www.solocomputerservices.com/scsgrafx.php
"L. McKenzie" <customersupport@solocomputerservices.com> a écrit dans le
message de news:PWkvc.35414$zO3.23147@newsread2.news.atl.earthlink.net...
> Ben wrote:
>
of
and
text
> Make a mask of just the text minus the background in PS. If the image,
> zoomed at 100% looks destinct (color difference between the gray and
> black) enough, then use either the color range tool or if the colors are
> too similar, then use the path tool to do it by hand. Once you have
> selected the text that you want, click CMND or CTRL J to create a new
> layer of just your selection which will be that text you want. Once
> done, go back to the original layer, make sure that nothing is selected
> and fill it in with pure white. Flatten the layer and try your OCR
> software with that.
>
> --
>
Text hasn't been written in Photoshop.
It's a scanned document.
How can I make a mak of just the test minus the background ?
"Ben" <sem_lotr@hotmail.com> writes:
> Hi there !
>
> Could somebody help me please ?
>
> I try to do extract text for a document with FineReader.
> It works fine with black text on white background.
> The problem is that some parts of the document have black text on a kind o
f
> gray background.
> In fact this gray background is made of small black dots, like the text, a
nd
> the OCR can get it.
>
> Is it possible to remove the background with Photoshop to have only the te
xt
> left ?
What you really need is "descreening". Perhaps your scanner has this
option. If not, you can try a Gaussian blur that will turn the dots into
a bumpy gray color, then a threshold operation so that the gray background
goes to white, while the text goes to black. Of course, you will be removing
some detail from the text, perhaps enough to break the OCR.
--
-Stephen H. Westin
Any information or opinions in this message are mine: they do not
represent the position of Cornell University or any of its sponsors.
Ben wrote:
> "L. McKenzie" <customersupport@solocomputerservices.com> a écrit dans le
> message de news:PWkvc.35414$zO3.23147@newsread2.news.atl.earthlink.net...
>
>
> of
>
>
> and
>
>
> text
>
>
>
> Text hasn't been written in Photoshop.
> It's a scanned document.
>
> How can I make a mak of just the test minus the background ?
>
>
>
It won't matter. It is an image.
--
__________________________________________________
Leo McKenzie
www.solocomputerservices.com/scsgrafx.php
Ben wrote:
> "L. McKenzie" <customersupport@solocomputerservices.com> a écrit dans le
> message de news:PWkvc.35414$zO3.23147@newsread2.news.atl.earthlink.net...
>
>
> of
>
>
> and
>
>
> text
>
>
>
> Text hasn't been written in Photoshop.
> It's a scanned document.
>
> How can I make a mak of just the test minus the background ?
>
>
>
Ben.. Do a google search for making masks in Photoshop or use the tip
that Stephen posted. Both will work for you. Thanks.
--
__________________________________________________
Leo McKenzie
www.solocomputerservices.com/scsgrafx.php