Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
foundation:gsoc_geganage [2010/08/17 17:58]
lgtkaushalya
foundation:gsoc_geganage [2010/12/18 17:35] (current)
Line 127: Line 127:
 But the accuracy was not improved at that stage and most of the time Tesseract returned a segmentation fault error at the images. So then I have tried for a data set which I was written by myself.  But the accuracy was not improved at that stage and most of the time Tesseract returned a segmentation fault error at the images. So then I have tried for a data set which I was written by myself. 
  
-{{:foundation:handwritten2.jpg|}}+{{:foundation:handwritten2.jpg?792×208|}}
      
 A portion of a sample image I have written to train the Tesseract for handwritten letter A portion of a sample image I have written to train the Tesseract for handwritten letter
Line 155: Line 155:
  
 While I have working with the system I have recognized that the SahanaOCR was unable to process the rotated images at about 5 degrees. It was only able to change image upside down and process it. But for the images which were at -5 to 5 degrees and 175 to 185 degrees rotated the system does not validate the forms with the xforms.  So I had to modify the algorithm which is used to rotate the images. Then It was able to correctly rotate the images. The following two images show the correct rotation of the images by the system.   While I have working with the system I have recognized that the SahanaOCR was unable to process the rotated images at about 5 degrees. It was only able to change image upside down and process it. But for the images which were at -5 to 5 degrees and 175 to 185 degrees rotated the system does not validate the forms with the xforms.  So I had to modify the algorithm which is used to rotate the images. Then It was able to correctly rotate the images. The following two images show the correct rotation of the images by the system.  
 +
 +
                        
-{{:foundation:original_rotated.jpg|}}                    {{:foundation:horizontally_proceesed_image.jpg|}}+{{:foundation:original_rotated.jpg?496x697|}}                    {{:foundation:horizontally_proceesed_image.jpg?496x697|}}
                        
-Original image which is rotated in to 175 deg   Correctly rotated image by the system+Original image which is rotated in to 175 deg and its corresponding properly rotated image by the system 
 + 
  
 In this the data filed coordinates got small deviation with the rotated images. So there were some errors with the segmented letter boxes. So we planned to handle it by applying more improved algorithm to it. I’ll list it at the todo section.  In this the data filed coordinates got small deviation with the rotated images. So there were some errors with the segmented letter boxes. So we planned to handle it by applying more improved algorithm to it. I’ll list it at the todo section. 
Line 180: Line 184:
 Following screen shot shows the design of the Log Form. Following screen shot shows the design of the Log Form.
    
-{{:foundation:logform_modified.jpg|{{:foundation:logform_modified.jpg|}}}}+{{:foundation:logform_modified.jpg|{{:foundation:logform_modified.jpg|}}
  
 Screen shot of the Log form of the UI while running a process Screen shot of the Log form of the UI while running a process
Line 187: Line 191:
 Then I have started working with integrating the Scanner Manager option to the UI. That was loading images directly from the scanners. Using that we can automate the process of the form loading to the system.  Then I have started working with integrating the Scanner Manager option to the UI. That was loading images directly from the scanners. Using that we can automate the process of the form loading to the system. 
  
- Now the images were correctly uploaded to the system using the Scanner Manager. +Now the images were correctly uploaded to the system using the Scanner Manager.  
 + 
 After all I have identified some more functionality to add to the system so it could be more usable for the users.   After all I have identified some more functionality to add to the system so it could be more usable for the users.  
  
Line 193: Line 199:
 == To do ==  == To do == 
  
-  These are the features I have identified to improve the system further in the future.  +These are the features I have identified to improve the system further in the future. 
 + 
   * To improve the accuracy of the outputs we had to correctly create a training dataset for the handwritten characters using Tesseract.   * To improve the accuracy of the outputs we had to correctly create a training dataset for the handwritten characters using Tesseract.
  
Line 200: Line 208:
   * Completing the NetMngr and complete the system to upload the recognized data to its corresponding module.    * Completing the NetMngr and complete the system to upload the recognized data to its corresponding module. 
  
 +== User Guide ==
 +
 +This is the link for the user guide for the features that are provided by the existing SahanaOCR system. 
 +
 +http://wiki.sahanafoundation.org/doku.php/wiki:user:lgtkaushalya 
 +
 +This is the link for the video demo of the current SahanaOCR application
 +
 +http://www.youtube.com/watch?v=Zl3KR8QEHyI 
 +
 +Here is the link for the progress report of the SahanaOCR project during Gsoc 2010
 +
 +http://www.mediafire.com/?am1aerng63ni450
  
 == Conclusion == == Conclusion ==

QR Code
QR Code foundation:gsoc_geganage (generated for current page)