問(wèn)題描述
背景
對(duì)于我在大學(xué)的最后一個(gè)項(xiàng)目,我正在開發(fā)一個(gè)車牌檢測(cè)應(yīng)用程序.我認(rèn)為自己是一名中級(jí)程序員,但是我的數(shù)學(xué)知識(shí)缺乏中學(xué)以上的任何知識(shí),這使得生成正確的公式比應(yīng)該做的更難.
我花了很多時(shí)間查找學(xué)術(shù)論文,例如:
- 解決方案
您可以采取多種方法,但首先想到的策略是:
- 發(fā)現(xiàn)/研究:確定您可能需要識(shí)別的一組顏色和字體.如果您的樣本圖片代表了大多數(shù)英國(guó)車牌,那么您的工作就會(huì)變得更容易.例如.簡(jiǎn)單、單一的字體和白色背景上的黑色字體
- 代碼:嘗試識(shí)別圖像的矩形區(qū)域,其中顏色主要是白色和黑色.這不是一個(gè)非常繁重的數(shù)學(xué)問(wèn)題,它應(yīng)該讓您專注于車牌區(qū)域.
- 代碼:對(duì)您的子區(qū)域進(jìn)行一些清理,例如將其轉(zhuǎn)換為純黑白(單色),并可能縮放/移動(dòng)成一個(gè)漂亮、緊湊的矩形.
- 使用 API:接下來(lái)在您的子選擇圖像區(qū)域上使用現(xiàn)有的 OCR(光學(xué)字符識(shí)別)算法,以便查看您是否可以閱讀文本.
就像我說(shuō)的那樣,這是許多策略中的一種,但它被認(rèn)為是一種需要最少大量數(shù)學(xué)運(yùn)算的策略……也就是說(shuō),如果您能找到適合您的 OCR 實(shí)現(xiàn).
Background
For my final project at university, I'm developing a vehicle license plate detection application. I consider myself an intermediate programmer, however my mathematics knowledge lacks anything above secondary school, which makes producing the right formulas harder than it probably should be.
I've spend a good amount of time looking up academic papers such as:
- Detecting Vehicle License Plates in Images
- Robust License Plate Detection using Image Saliency
- Local Enhancement of Car Image for License Plate Detection
When it comes to the math, I'm lost. Due to this testing various graphic images proved productive, for example:
to
However this approach only worked to that particular image, and if the techniques were applied to different images, I'm sure a poorer conversion would occur. I've read about a formula called the "bottom hat morphology transform", which does the following:
Basically, the trans- formation keeps all the dark details of the picture, and eliminates everything else (including bigger dark regions and light regions).
I can't find much information on this, however the image within the documentation near the end of the report shows its effectiveness.
Other constraints
- Developing in C#
- Confining the project to UK registration plates only
- I can choose the images to convert as a demonstration
Question
I need advice on what transformation techniques I should focus on developing, and what algorithms can help me.
EDIT: New information present on Continued - Vehicle License Plate Detection
解決方案There are a number of approaches you can take but the first strategy that pops into mind is to:
- Discovery/research: Identify the set of colors and fonts that you may need to identify. If your sample picture is representative of most British plates then your job is made easier. E.g. Simple, singular font and black lettering on a white background
- Code: Attempt to identify a rectangular region of an image where the colors are predominantly white and black. This is not a terribly math-heavy problem and it should give you the license plate region to concentrate on.
- Code: Do some clean up on your subregion such conversion to pure black and white (monochrome) and perhaps scaling/shifting into a nice, tight rectangle.
- Use API: Next employ an existing OCR (optical character recognition) algorithm on your sub-selected image region so see if you can read the text.
Like I said, this is one strategy of many but it comes to mind as one requiring the least amount of heavy math... that is if you can find an OCR implementation that will work for you.
這篇關(guān)于車牌檢測(cè)有哪些好的算法?的文章就介紹到這了,希望我們推薦的答案對(duì)大家有所幫助,也希望大家多多支持html5模板網(wǎng)!
【網(wǎng)站聲明】本站部分內(nèi)容來(lái)源于互聯(lián)網(wǎng),旨在幫助大家更快的解決問(wèn)題,如果有圖片或者內(nèi)容侵犯了您的權(quán)益,請(qǐng)聯(lián)系我們刪除處理,感謝您的支持!