Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
foundation:gsoc_geganage [2010/08/17 17:49]
lgtkaushalya
foundation:gsoc_geganage [2010/12/18 17:35] (current)
Line 127: Line 127:
 But the accuracy was not improved at that stage and most of the time Tesseract returned a segmentation fault error at the images. So then I have tried for a data set which I was written by myself.  But the accuracy was not improved at that stage and most of the time Tesseract returned a segmentation fault error at the images. So then I have tried for a data set which I was written by myself. 
  
 +{{:foundation:handwritten2.jpg?792×208|}}
      
 A portion of a sample image I have written to train the Tesseract for handwritten letter A portion of a sample image I have written to train the Tesseract for handwritten letter
Line 155: Line 155:
  
 While I have working with the system I have recognized that the SahanaOCR was unable to process the rotated images at about 5 degrees. It was only able to change image upside down and process it. But for the images which were at -5 to 5 degrees and 175 to 185 degrees rotated the system does not validate the forms with the xforms.  So I had to modify the algorithm which is used to rotate the images. Then It was able to correctly rotate the images. The following two images show the correct rotation of the images by the system.   While I have working with the system I have recognized that the SahanaOCR was unable to process the rotated images at about 5 degrees. It was only able to change image upside down and process it. But for the images which were at -5 to 5 degrees and 175 to 185 degrees rotated the system does not validate the forms with the xforms.  So I had to modify the algorithm which is used to rotate the images. Then It was able to correctly rotate the images. The following two images show the correct rotation of the images by the system.  
 +
 +
                        
 +{{:foundation:original_rotated.jpg?496x697|}}                    {{:foundation:horizontally_proceesed_image.jpg?496x697|}}
                        
-Original image which is rotated in to 175 deg   Correctly rotated image by the system+Original image which is rotated in to 175 deg and its corresponding properly rotated image by the system 
 + 
  
 In this the data filed coordinates got small deviation with the rotated images. So there were some errors with the segmented letter boxes. So we planned to handle it by applying more improved algorithm to it. I’ll list it at the todo section.  In this the data filed coordinates got small deviation with the rotated images. So there were some errors with the segmented letter boxes. So we planned to handle it by applying more improved algorithm to it. I’ll list it at the todo section. 
-Designing the UI+ 
 + 
 +== Designing the UI == 
 + 
 Then I have worked with improving the UI features. So after discussing with the mentor I have started working with combine main functionalities as  Then I have worked with improving the UI features. So after discussing with the mentor I have started working with combine main functionalities as 
 +
   * loading images by file system   * loading images by file system
   * loading xforms to the system   * loading xforms to the system
   * process the form and get results   * process the form and get results
 +
 Then after that I have design the Log Form features to show the processing criteria of the forms. The Log Form contained following features.  Then after that I have design the Log Form features to show the processing criteria of the forms. The Log Form contained following features. 
  
Line 170: Line 181:
   * Show the recognized letter corresponding to the segmented images   * Show the recognized letter corresponding to the segmented images
   * Show the full results of the form while processing   * Show the full results of the form while processing
 +
 Following screen shot shows the design of the Log Form. Following screen shot shows the design of the Log Form.
    
- Screen shot of the Log form of the UI while running a process+{{:foundation:logform_modified.jpg|{{:foundation:logform_modified.jpg|}} 
 + 
 +Screen shot of the Log form of the UI while running a process 
 + 
 Then I have started working with integrating the Scanner Manager option to the UI. That was loading images directly from the scanners. Using that we can automate the process of the form loading to the system.  Then I have started working with integrating the Scanner Manager option to the UI. That was loading images directly from the scanners. Using that we can automate the process of the form loading to the system. 
- Now the images were correctly uploaded to the system using the Scanner Manager. + 
 +Now the images were correctly uploaded to the system using the Scanner Manager.  
 + 
 After all I have identified some more functionality to add to the system so it could be more usable for the users.   After all I have identified some more functionality to add to the system so it could be more usable for the users.  
-To do + 
-  These are the features I have identified to improve the system further in the future.  + 
 +== To do ==  
 + 
 +These are the features I have identified to improve the system further in the future. 
 + 
   * To improve the accuracy of the outputs we had to correctly create a training dataset for the handwritten characters using Tesseract.   * To improve the accuracy of the outputs we had to correctly create a training dataset for the handwritten characters using Tesseract.
 +
   * To handle the rotated images we had to change the current algorithm. The current algorithm first recognize the form the 5 black boxes at the edges of the image. Then it extract the form section which is bounded by those edges and then extract the data fields , input areas and the letter boxes according to coordinates from those edges. But if there is a small deviation in a position if any data filed all the other areas inside that does not correctly get segment. So we have to change it to first extract a little bit larger area than the data filed and then recognize the edges of the data filed using image processing and then process the fields within that. It may remove these issues with the rotated images.   * To handle the rotated images we had to change the current algorithm. The current algorithm first recognize the form the 5 black boxes at the edges of the image. Then it extract the form section which is bounded by those edges and then extract the data fields , input areas and the letter boxes according to coordinates from those edges. But if there is a small deviation in a position if any data filed all the other areas inside that does not correctly get segment. So we have to change it to first extract a little bit larger area than the data filed and then recognize the edges of the data filed using image processing and then process the fields within that. It may remove these issues with the rotated images.
 +
   * Completing the NetMngr and complete the system to upload the recognized data to its corresponding module.    * Completing the NetMngr and complete the system to upload the recognized data to its corresponding module. 
-Conclusion+ 
 +== User Guide == 
 + 
 +This is the link for the user guide for the features that are provided by the existing SahanaOCR system.  
 + 
 +http://wiki.sahanafoundation.org/doku.php/wiki:user:lgtkaushalya  
 + 
 +This is the link for the video demo of the current SahanaOCR application 
 + 
 +http://www.youtube.com/watch?v=Zl3KR8QEHyI  
 + 
 +Here is the link for the progress report of the SahanaOCR project during Gsoc 2010 
 + 
 +http://www.mediafire.com/?am1aerng63ni450 
 + 
 +== Conclusion == 
 SahanaOCR is a system which came out form innovative ideas and it contains some new concept in a practical scenario. With working with the system during the project period I gained a lot of knowledge and that was a fascination era of my life. For that Gihan , Jo , Chammindra , Michel,  Hayesha and Suryagith helped me a lot and others from the community helped too. I’m willing to work with the project further more finish it as a complete project in the near future.     SahanaOCR is a system which came out form innovative ideas and it contains some new concept in a practical scenario. With working with the system during the project period I gained a lot of knowledge and that was a fascination era of my life. For that Gihan , Jo , Chammindra , Michel,  Hayesha and Suryagith helped me a lot and others from the community helped too. I’m willing to work with the project further more finish it as a complete project in the near future.    
  

QR Code
QR Code foundation:gsoc_geganage (generated for current page)