TH-OCR NK
content introduction go to next page background assumptions overall structure chart IPO for overall structure dataflow diagram of overall structure analysis methods structure chart modules 3.0 user interface requirement program flowchart data structure detail of a modules
After the preliminary investigation, it is the time for us to go to the kernel -- the design stage. Since the further implementations are all based on this design report. the details should be involved clearly. go to next page introduction go to previous page
Being a successful secretary in the office, good typing skills is need. However, if a big pile of document is waiting for you, OCR is you good assistant. It can help you to input all documents within a few minutes. Besides secretary, people usually need to type much document into computer. If you want to do it in a efficient way, TH-OCR-HK is you first choice. go to next page background go to previous page
go to next page assumptions We assume the users have the ability to use a scanner. Users are expected to have ability to type in Chinese or have a hand writing board. go to previous page
go to next page overall structure chart TH-OCR NK 1.0 File 2.0 Text modification 4.0 Assistance 5.0 From scanner 2.1 Word Recognition 3.0 Insert 4.1 From file 2.2 Edit 4.2 New word creation 5.2 Help 5.1 go to previous page
go to next page IPO for overall structure Processing 1. choosing language 2. modifying the text 3. inserting objects 4. using function to edit the textInput File Inputoutput file document on screen go to previous page
go to next page dataflow diagram of overall structure primary object editing existed file other text file final result file document on screen go to previous page
go to next page analysis methods TOP DOWN method has been used. Process from the upper level, objectives and operations of the information system as a whole are considered. Since the main job is not separated into many tasks, it is more efficient to manage the job by the TOP DOWN method. go to previous page
go to next page structure chart modules 3.0 (word recognition) recognize by part recognize all modify with script italics rectification layout analysis manual italics rectification manual layout analysis line order reversal learning of new word word bank modification option 3.4 word recognition 3.0 go to previous page
A graphic file is loaded into TH-OCR NK go to next page the script the script is display in a larger scale user interface go to previous page
Analysis is finished. It’s the time for editing the text. go to next page the analyzed part of the script the original script user interface go to previous page
hardware requirement : or above 4 MB RAM 2X CD ROM scanner software requirement : Windows 95/98/NT NJwin2.0 (if recognize Japanese character) go to next page requirement go to previous page
Users are expected to know how to use scanner how to input Chinese characters or Japanese character ( if recognize Japanese characters) operate Windows 95/98/NT go to next page requirement go to previous page
go to next page program flowchart and data structure for ‘word recognition’ do not provide this function token the word from the script start compared the word with the word bank file opened ? yes no data structure script = record word : integer line : integer end; go to previous page
go to next page program flowchart and data structure for ‘word recognition’ take the most similar word into object file is the key ‘Esc’ pressed? do the compare finish ? no yes finish go to previous page bank = record language : string word : integer end; object = record word : integer line : integer end;
go to next page detail of ‘word recognition’ modules go to previous page 1. module abstractThe main function of this module is to let the paper script change into a computer text file. 2. implementation documentation There are no instruction that the user should know. 3. design Check whether the script file is opened if selected, then start the word recognition else refuse to start the function
go to next page detail of ‘word recognition’ modules go to previous page 4. exports Global type declaration and constants script’s record object 5. importsno external procedure is necessary 6. input / output Input : the script file Output : the object file display the object on the screen
go to next page detail of ‘word recognition’ modules go to previous page 7. pre and post conditions The important variant on the data structure is the script record and the word bank. If the script change, the object will be changed. If the word bank change, word can not be recognize correctly. 8. error handlingError may cause by the memory jam. It will happen when the script file contain too many words. If error occurs, user will be asked to break the script file into two subfiles. 9. test caseTest this software with different size of script files and open them altogether in order to test the software memory management.
software : TH-OCR NK date : 1st. November, 1999 account no Please call us if you have any question. Tel : Fax :