QUICK REVIEW

[論文レビュー] Recognition of Non-Compound Handwritten Devnagari Characters using a Combination of MLP and Minimum Edit Distance

Sandhya Arora, Debotosh Bhattacharjee|arXiv (Cornell University)|Jun 30, 2010

Handwritten Text Recognition Techniques参考文献 15被引用数 44

ひとこと要約

本稿では、シャドウ特徴量とチェーンコードヒストグラム特徴量を用い、マルチレイヤーパーセプトロン（MLP）分類器と最小編集距離（MED）を組み合わせた2段階のオフライン筆跡非複合デーヴァナーガリ文字認識システムを提案する。本手法は、初期分類に重み付き多数決を活用し、形状が類似する文字の解消にハリスコーナー検出を用いることで、7,154サンプルのデータセットで90.74%の認識精度を達成した。

ABSTRACT

This paper deals with a new method for recognition of offline Handwritten non-compound Devnagari Characters in two stages. It uses two well known and established pattern recognition techniques: one using neural networks and the other one using minimum edit distance. Each of these techniques is applied on different sets of characters for recognition. In the first stage, two sets of features are computed and two classifiers are applied to get higher recognition accuracy. Two MLP's are used separately to recognize the characters. For one of the MLP's the characters are represented with their shadow features and for the other chain code histogram feature is used. The decision of both MLP's is combined using weighted majority scheme. Top three results produced by combined MLP's in the first stage are used to calculate the relative difference values. In the second stage, based on these relative differences character set is divided into two. First set consists of the characters with distinct shapes and second set consists of confused characters, which appear very similar in shapes. Characters of distinct shapes of first set are classified using MLP. Confused characters in second set are classified using minimum edit distance method. Method of minimum edit distance makes use of corner detected in a character image using modified Harris corner detection technique. Experiment on this method is carried out on a database of 7154 samples. The overall recognition is found to be 90.74%.

研究の動機と目的

形状のばらつきや視覚的類似性のため、高精度で手書き非複合デーヴァナーガリ文字を認識する課題に対処すること。
視覚的に類似する文字の誤分類を減らすために、それらを別々の処理パスに分離すること。
特徴の多様性と重み付き多数決を用いた分類器統合により、認識性能を向上させること。
幾何学的および構造的特徴を用いて、混乱しやすい文字ペairを区別する堅牢な手法を開発すること。
ニューラルネットワークと文字列編集ベース分類戦略を組み合わせることで、全体の認識精度を高めること。

提案手法

入力文字画像からシャドウ特徴量とチェーンコードヒストグラム特徴量を抽出し、2つのMLP分類に用いる。
シャドウ特徴量を用いたMLPと、チェーンコードヒストグラム特徴量を用いたMLPを別々に訓練する。
2つのMLPの意思決定を重み付き多数決スキームを用いて統合し、上位3つの候補文字を出力する。
上位3つの候補の相対差分値を計算し、文字を「形状が明確な文字」と「混乱しやすい文字」の2グループに分類する。
混乱しやすい文字セットに対して、修正版ハリスコーナー検出アルゴリズムで検出されたコーナー点を用いて最小編集距離（MED）を適用する。
コーナー点の順序に基づくMEDを用い、構造的類似度を測定し、最も類似した参照文字に最終ラベルを割り当てる。

実験結果

リサーチクエスチョン

RQ1MLPと最小編集距離を組み合わせたハイブリッドアプローチは、手書きデーヴァナーガリ文字の認識精度を向上させることができるか？
RQ2シャドウ特徴量とチェーンコードヒストグラム特徴量を重み付き多数決と併用することで、初期分類にどの程度有効であるか？
RQ3コーナーに基づく表現を用いた最小編集距離は、視覚的に類似するデーヴァナーガリ文字間の混乱をどの程度解消できるか？
RQ4文字を「形状が明確な文字」と「混乱しやすい文字」に分けることで、全体の認識性能が向上するか？
RQ5特徴の多様性と分類器統合は、非複合デーヴァナーガリ文字の認識精度にどのような影響を与えるか？

主な発見

提案された2段階手法は、7,154サンプルの手書きデーヴァナーガリ文字サンプルデータセットで全体で90.74%の認識精度を達成した。
シャドウ特徴量とチェーンコードヒストグラム特徴量を用いた2つのMLPの併用により、特徴レベルの多様性によって初期分類の信頼性が向上した。
重み付き多数決スキームにより、2つの異なる特徴セットからの意思決定を集約することで、誤分類が効果的に低減された。
コーナー検出を用いた最小編集距離アプローチにより、視覚的に類似する文字ペアの認識が顕著に向上した。
文字を「形状が明確な文字」と「混乱しやすい文字」に分けることで、分類戦略を最適化でき、全体のシステムの頑健性が向上した。
修正版ハリスコーナー検出技術により、曖昧な文字のMEDベース分類に不可欠な構造的特徴が効果的に抽出された。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。