the main › SSD drive › How to scan the document so that you can edit. As a scanned document Translate to Word format

How to scan the document so that you can edit. As a scanned document Translate to Word format

Store scanned documents on a hard disk of a computer or external medium convenient and safe. However, how to make changes to the pages usually represented as an image? We will need special programs, on the installation and management of which we will tell below.

How to scan the document before editing?

To successfully manipulate the file in the future, it is important to properly translate it into the "Pictures" format, as well as take into account some simple, but useful nuances in the process itself. For this:

Scroll through all the chances and folds so that they do not appear on the scan and have not led to the difficulties in recognizing letters.
For convenience, save the file in PDF, JPG or TIFF format.
The PDF document can be opened and edited by Adobe Acrobat (or any other intended for such purposes).
Go to the scanner creator website, or look for a branded program on the attached disk (often well-known brands have their own applications for changing scanned pages).
To subsequently use the file in MS Office 2003 or 2007, set the Microsoft Office Document Scanning utility. It makes converting the scanned file automatically, transferring it to the text immediately (the program does not work with more "fresh" versions of the office).
It is recommended to scan in black and white gamma, and not in color - it simplifies the analysis of the text.
TIFF format is best used for OCR converters, that is, programs producing optical recognition.

How to edit a scanned document - Working with OCR utilities

The principle of the OPTICAL CHARACTER RECOGNITION method is reading on paper characters, their subsequent comparison with elements from its own database. Thus, there is a conversion of a solid picture to the editable text. Bright examples of programs that cope with this task are Adobe Acrobat and Evernote. To make corrections to the available scan, simply open it one of these applications, the entire subsequent process will automatically happen. When the program finishes recognition, it will offer the user to save the document in one of the available formats.

How to edit a scanned PDF document

If the scanned document is saved in the PDF file, we can easily edit it in the Acrobat DC program. For this:

open the menu "Tools" -\u003e "Edit PDF";
the program starts the edit process, showing the menu of prompts in the right corner from above;
by clicking on it and selecting "Parameters", you can specify the recognition language;
what to make changes, just click on any line of the document;
the document, open for editing through the OCR, is accompanied by a special panel with settings posted on the right side of the screen;
in the "Settings" section, except for the language, it is also convenient to select the displayed font, mark the pages that need to be edited (all or one).

On the worldwide network there is an affordable alternative to the installed converter programs. These are online OCRs that easily translate the resulting image into any text format. For example, the PDFonline.com website will allow a few minutes from the scanned PDF document to make the usual MS Word file.

If you chose the rapid way of writing the theoretical chapter, which we talked about in paragraph 2.1., Most likely you can not do without scanning documents. Otherwise, this item can skip and start visualing materials found in the library.

Before starting the scan, you need to decide what exactly you want to use when writing work. And for this you need to first view the existing literature and highlight the necessary moments with a pencil.

When I first scanned an article from the magazine for my first coursework, for me this occupation was unimaginably difficult. As a result of several hours of working with a scanner and FineReader, I had a bredyatina at the exit, which cannot be edited. As a result, I had to get everything with my hands. So that you do not happen like this, consider in more detail all the technical points of scanning.

To scan, we, of course, will need a scanner. His not necessarily buy. You can, for example, take a comrade for a while. I use the CANOSCAN LIDE 60 scanner. This is not the most new model, but I really like this compact, fast and convenient in the work "Device". If you took the scanner for time, in order for it to work, you must first install the driver program. Drivers and installation guide can always be found on the installation disk that is attached to the device or download on the site from the manufacturer. After installing the driver, connect the scanner to the computer using the connecting cord. Now you can already directly proceed to scan.

But first a little theory. You should know that the scanning process consists of two stages:

1. Directly scanning a document. At this stage, the scanner is like photographing the surface of the scanned document and saves the resulting image to the computer as a regular file.jpg .gif or in another format;

2. Recognition of the document. This is the process of converting text from the image made by the scanner to a normal test, which then can be saved in Word and edit. Recognition is carried out without the participation of the scanner using a special program (the most popular Adobe FineReader). Thus, you can first scan multiple sheets of text and save them as an image and only then convert into text.

So, let's begin Stage First - Scanning:

- Run the scanner driver: Start - All Programs - Canon - Scanger (I specify the driver name for your scanner). A driver window appears:

- Open the scanner cover and put a book, a magazine or a copy of them with text down, as you can smoothly in relation to the edges of the scanner working surface:

It is very important to make the scanner cover as closely pressed the scanned document, not allowing external illumination to enter the non-working surface of the scanner, which comes into contact with the document;

- Perform the necessary installations in the scanner driver. First of all, you need to set the resolution in which the document will be scanned. Resolution is an indicator that determines the level of detailing the object when scanning and is determined at inch points (DPI, or T / D). The greater the resolution, the better the image is obtained. But, when scanning text documents, it makes no sense to set the maximum permit, since it will be zero. In addition, scanning with a large resolution takes longer. I recommend setting permission within 400-500 t / d (DPI). With this setting, the image is obtained quite high-quality for good recognition, and the scanning process itself does not take much time. I suggest look at the screenshot of the installations of my printer:

First you need to go to "Advanced Mode".The source will always be "The tablet"(Tablet scanner). Color mode is better to install "Black and white"After all, to scan the text, we do not need colors, and this will reduce the size of the output images. Permission as I already said should be installed 400 t / d. Output image size - required "A4". Now you can safely guide the button. "Scan". My scanner is arranged in such a way that first remembers the scanned images in the internal memory, and only when closing the driver window, it offers to save them to a computer. I only have to specify the place where the results of work will be saved.

You must have files of this type:

With an increase in such an image, the text should be clearly visible.

Second phase – recognition The obtained images and their conversion into text. As I said, it will take a special program for this - FineReader.. Download the program on this link (32MB). Password to archive - site. The version proposed by me does not require installation (Portable). The folder with the program will have many different files, but you only need one - FineReader.exe.. Double click on this file will launch a program on your computer.

This version of the program is old enough. All screenshots below I did using it. If this version FineReader. You do not start - select more new.

Window FineReader. It has the following form:

After setting the language on which the previously scanned documents are printed, you can begin recognition. If two languages \u200b\u200bare present in the text (for example, Russian and English) Installation, respectively.

To start recognition Click on the arrow to the right of the first button Scan - and then - Open Image:

The image selection window opens. Open the folder in which you saved scanned images, click Ctrl + A. (English) on the keyboard and click on the button Open.

After that on the left in the window FineReadersketches of added files will appear, in the center - at the moment a dedicated sketch in an enlarged form, from below - even greater increase, and on the right recognition result:

For example, I took only two images. In the screenshot above, the first of them is highlighted, it is now and recognize. As you can see, the image is scanned vertically to recognize the text of the snapshot must first deploy 90 degrees. To do this, we use the buttons and. The next step you need to specify the program, which part of the image you need to recognize, as well as set the data type that should turn out on the output of the text, table or image. For this, there are buttons, respectively :. For example, if you want to mark the text block, press the left button on, then press the left mouse button in the upper left corner of the text block and, holding the left button, drag into the lower right corner. For example, I fully prepared for recognition one image:

As you can see, all text blocks in the example above are highlighted green, and drawings are red. Tables are prepared for recognition similarly. To do this, the button is intended. In order to go to the next picture, click on the left mouse button on its sketch on the left. Thus are prepared for recognition all the images obtained as a result of scanning. After the preparation of images is complete, all them should be highlighted. To do this, click the left button in the scratch in the sketch panel (it is called Package) and press Ctrl + A. (English) on the keyboard. Next click on the button and wait until FineReader. Converts images to text. After that, you can save the resulting text in Word using the button, after clicking on which the window opens. It is necessary to select a preservation format - Microsoft Word, as well as put a mark to save all pages:

After pressing the button OK The program will create a Word document and inserts text from recognized pages in the order in which they are on the sketch panel (package). Required the document immediately save to the folder in the file structure of the thesis and you can proceed to edit. How it is done is described in my free course.

And the last moment. Esley you scanned a newspaper or magazine, the text there is often given as columns (as in the example above). These columns in the Word need to be converted to one. Select text as columns and execute the command: Format - columns - one - ok. Only after that you can put a book orientation in the page parameters, field indents, font, etc.

How to scan the document and recognize it in MS Word

When working with paper documents, manuscripts or books, often the need to translate everything into electronic format. This opens up much more features and significantly facilitates the editing process. In the presence of a scanner or a digital camera with high resolution, this will not be difficult, but then the question arises as a scanned document to translate into Word format? In order not to reprint everything by manually, you should use specialized software.

Software solutions for the conversion of scanned documents

Such a task should not cause difficulties. Modern programs allow you to edit the scanned document partially and completely translate it into a convenient Word format. And this can be done literally in a few minutes.

Tip: Thanks to high-speed Internet, you can easily find the desired program to edit scanned documents. Moreover, you can now use online services for text recognition.

Among the popular programs for performing such operations can be allocated:

1. ABBYY FineReader (including online);

3. Readiris Pro;

6. Online OCR Convert service, etc.

A rich functionality and simplicity in circulation make them quite popular. High reliability and performance are appreciated by both ordinary users and business representatives. Even an inexperienced person can quickly figure out how to scan the document in Word.

Text Recognition and Scanned Document Conversion

Usually have to deal with pictures in format.jpg, .tiff, .png, .bmp is the result of scanning or photographing. How to translate document to Word for further work? Text cannot be edited by conventional ways. Some scanners support automatic conversion in format.pdf, but the possibilities are still limited.

To get a full text document, you should upload the file to the program through a special form (click "Open" or "Download"). To increase accuracy, you can specify the range of pages and select a specific area with text. After some time, a preliminary result will appear. After that, it remains to save the file.doc to figure out how to edit the scanned document using MS Word.

How to edit a scanned document, two ways that can help cope with the task.

Probably you want to know how to make money stably on the Internet from 500 rubles a day?
Download my free book
=>>

Today is a very interesting topic, at least for me. But, I think that you can find out how to edit the scanned document.

I will say honestly - this is not such a simple topic, as it may seem at first sight. In many ways, the answer to this question depends on the document itself and on what we need to get.

That is, what kind of result you need. After all, in fact, there are two ways to edit a scanned document.

Translation into text format

As you understand, the scanned document is a graphic file of PNG format, JPG, JPEG. Simply put - this is the usual picture.

In the case when the scanned document contains ordinary text on a white background, where you want to make changes to the content of the text, then the best option will be translated into a text format.

After that, make text editing, and then save this file in Word format, or in text format. Next, if necessary, display a file from an electronic format into paper - print on the printer.

About Tom, I recently wrote a detailed, deployed article.

I do not see the point who does not know how to do it, will be able to read my detailed instructions.

I will only say that by copying text from the picture, for example, in Google documents, you can develop the document content in the same way as you need.

How to edit a scanned document in Photoshop

The second way, in my opinion the most interesting - edit the document in Photoshop.

In principle, edit the scanned document can be edited in any graphic editor, however Photoshop, in my opinion, the most convenient, multifunctional, and just the editor usual for me.

In Photoshop, you can make everything that your soul wishes - to transfer objects from place to place. Shift the signature, put the print, remove extra words or add new ones.

You can change the color of any object, apply the correction to the document, that is, whiten the background or make brighter the faded text.

By the way, about how to work with images in Photoshop, I wrote quite a lot of articles.

You can familiarize yourself with them:

Acquaintance with instruments

If you have a photoshop on your computer, or any, another graphic editor, then you can independently make the simplest action.

For example:

Make the background light, and the text is brighter;
Erase extra details;
Write text;
Make editing in content and so on.

However, if it is required more subtle, you can say jewelry, it is better to seek help to a professional. I want to note that I successfully perform similar works of any complexity.

You can leave an application. And we will continue the lesson. Let me introduce you to the most necessary tools that can be useful to you when working with a scanned document.

I will not be engaged in simple transfer tool, it's just a time loss, since everything is signed in Photoshop. Pay attention to the left panel, there are tools.

Summing up the cursor to each tool, you will see its name in the appearing prompt. And if you click on the triangle in the corner of each tool, you will seem some more similar tools to choose from.

Determine what makes one or another tool is not difficult. This you will understand from their name. So eraser - erases everything, for which it will happen, the brush - draws, pencil - writes, drives.

Tools for selection - allocate objects, transfer them to another layer, or simply shifted in the desired direction.

Stamp stamps captured plot in a new place and so on. When you click on the tool, the settings for it appear at the top.

There you can choose, for example, if it is a brush:

The size;
Softness or rigidity;
Specify transparency;
Choose push strength and so on.

As you understand, in one article it is impossible to describe all the features of the photoshop program - this is a material for a large series of lessons.

However, purely intuitively, so to speak by the method of folk tick, you can apply the necessary tools to edit the scanned document.

Top panel

The top panel is also important when processing images. For example, by opening the "Image" tab, you will see that you can apply to the picture.

For example:

Correction manual or automatic;
Changing the size of a picture or canvas;
Turns and mirror mappings;
Crimping and trimming and so on.

Without deep knowledge of the graphic editor, it is possible to edit the scanned documents, but it is unlikely to lead to the desired result.

If you still decided on this step, I advise you to make a duplicate document, just in case. And when editing, do not forget to create a copy of the layer. Then any changes can be removed along with a copy of the layer.

How to edit scanned document, example

An example of how to edit the scanned document in Photoshop.

Suppose that the document should be swap or letters, it does not matter.

To do this, I choose the rectangular selection tool, select the desired digit and copy it to a new layer.

After that, the tool move it in the right place.

I combine layers and keep the result obtained. In the screenshot below you can see the result of how I changed one digit in the code.

Try and you do the same, on any document or picture.

Do not be discouraged if it did not work the first time. A good result requires knowledge, skills, experience.

Therefore, the more often you will train, the faster learn to work in Photoshop. Good luck and do not forget to subscribe to the blog, in order to first learn about the release of new articles.

P.S. I apply the screenshot of my earnings in partner programs. Moreover, I remind you that everyone can earn, even newcomer! The main thing is to do it right, and therefore learn from those who already earn, that is, in the Internet business professionals.

The text recognition programs allow you to convert the photographed or scanned documents directly to the sentences.

The fact is that the text on the image is represented as a raster, a set of points. The mentioned software transfers the dot dot to the full text available for editing and saving.

The recognition of letters is designed to optimize the process of digitizing paper printed or handwritten books, documents.

This method of digitization to orders is superior to the speed of the manual set from the image. Widely applied when digitizing libraries and archives. Next, consider the top five of the best representatives of the family of such programs.

ABBYY FineReader 10.

FineReader unconditional leader among all programs recognizing the text in the image. In particular, the software, no more clearly machining Cyrillic. In general, in the asset of FineReader 179 languages, the text on which is recognized extremely successfully.

The only circumstance that can disappoint users is that the program is paid. Only a trial version for 15 days is distributed. During this period, a scanning of 50 pages is allowed.

Further for the use of the program will have to pay. FineReader Easily "eats" any more or less high-quality image. The source is completely unimportant. Whether it's photo, scan pages or any picture with letters.

Advantages:

accurate recognition;
a huge number of read languages;
tolerance to the quality of the source image.

Failure:

trial version for 15 days.

Ocr Cuneiform

Free program for reading text information from images. The accuracy of recognition is an order of magnitude lower than that of the previous program under consideration. But both for a free utility, the functionality is still at the height.

Interesting! Cuneiform recognizes text blocks, graphic images and even various tables. Moreover, even undelivered tables can be read.

To ensure accuracy to the recognition process, special dictionaries are connected, which replenish vocabulary from scanned documents.

Advantages:

free distribution;
use of dictionaries to verify the correctness of the text;
scanning text with poor quality photocopy.

Disadvantages:

relatively small accuracy;
a small number of supported languages.

Winscan2pdf.

This is not even a full-fledged program, but a utility. The installation will not be required, and the executive file weighs everything in a few kilobytes. The recognition process occurs extremely quickly, however, the documents obtained in its result are saved exclusively in PDF format.

In fact, the entire process is performed when you press the three buttons: select the source, destination and, actually, start the program.

The utility is designed for quick packet processing of multiple files. For the convenience of users, a large language interface package is provided.

Advantages:

portability;
fast work;
easy to use.

Disadvantages:

minimum size;
the only format of files at the output.

SimpleOCr.

Excellent small program to recognize texts from images. Supports even reading manuscripts. The trouble is that Russian does not come in a language package of the interface, the list of languages \u200b\u200bsupported to recognize languages.

However, if you need to scan English, Danish or French, then the best free option is not found.

In its field, the program provides accurate decoding of fonts, removing noise and removing graphic images. In addition, a text editor is built into the program interface, almost identical WordPad, which significantly improves the usability of the program.

Advantages:

accurate text recognition;
convenient text editor;
removing noise from the image.

Disadvantages:

full absence of Russian.

Freemore Ocr.

The program allows you to quickly extract text and graphics from images. Soft supports work with multiple scanners without loss of performance. The extracted text can be saved in a text document format or MS Office document.

In addition, a multi-page recognition feature is provided.

Freemore OCR is distributed free, however, the interface is only in English. But this circumstance does not affect the convenience of use, because the controls are organized intuitively understandable.

Advantages: