• Nem Talált Eredményt

Pattern Recognition

N/A
N/A
Protected

Academic year: 2022

Ossza meg "Pattern Recognition"

Copied!
68
0
0

Teljes szövegt

(1)

Pattern Recognition

PhD Course

(2)

Automatic Letter Recognition

Steps for the letter recognition:

1. Creating a training set:

- Separating characters from text

- Creating the feature vector for the separated character 2. Identify the unidentified characters using the training set

(3)

the training set

?

(4)

Pattern Recognition

X p-dimesional random vector, the feature vector

Y discrete variable, the classification, RY={1,2,…,M}

g decision function

If g then the decision makes error

(5)

In the formulation of the Bayes decision problem, introduce a cost function ) which is the cost if the label Y = y and the decision g() = y’ .

For a decision function g, the risk is the expectation of the cost: C(y),g()

In Bayes decision problem, the aim is to minimize the risk, i.e., the goal is to find a function

:{1,2, … , } such that

()= min

:{1,2,…}

(�)

where is called the Bayes decision function, and is the Bayes risk

(6)

For the posteriori probabilities, introduce the notations:

(

)

=�(Y= y) Let the decision function be defined by

,

=1

(¿) ( )

( )= arg min

¿

If arg min is not unique then choose the smallest y’ , which minimizes the sum.

This definition implies that for any decision function g,

,

,

(¿ ( ) ) ( )

(¿ ( ) ) ( )

=1

¿

=1

¿

(7)

Theorem For any decision function g, we have that R(

Proof. For a decision function g, let’s calculate the risk.

,

=1

(¿) { = , ( ) = }

=1

¿

¿ ¿

¿ {

=1

( , ( ) )

( ) }

This implies that

( ) =� {

=1

( , ( ) )

( ) } {

=1

( ,

( ) )

( ) } = R (

)

(8)

Concerning the cost function, the most frequently studied example is the so called 0 − 1 loss:

( ,

) = { 0 1 �� � �� � =

For the 0 − 1 loss, the corresponding risk is the error probability:

, and the Bayes decision is of form

( ) = argmin

( ) =argmax

( )

which is called maximum posteriori decision, too.

(9)

If the distribution of the observation vector has density, then the Bayes decision has an equivalent formulation. Introduce the notations for density of by

{

� � �

}

=

(

)

� �

and for the conditional densities by

{

� � �∨�=

}

=

(

)

� �

and for a priori probabilities =

{

=

}

then it is easy to check that

{

==

}

= � �

()

( )

(10)

and therefore

From the proof of Theorem we may derive a formula for the optimal risk:

(11)

If has density then

For the 0 − 1 loss, we get that

which has the form, for densities,

(12)

Multivariate Normal Distribution

(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32)
(33)
(34)
(35)
(36)
(37)
(38)

Linear Combinations

(39)

MVN Properties

(40)
(41)
(42)
(43)
(44)
(45)
(46)
(47)
(48)

Discriminant Analysis (DA)

(49)
(50)
(51)
(52)

That is, in multivariate normal case, we can reach the minimal risk!

(53)
(54)
(55)
(56)
(57)
(58)
(59)
(60)
(61)
(62)
(63)
(64)
(65)
(66)
(67)
(68)

Wilks' Lambda Test

Wilks' Lambda test is to test which variable contribute significance in

discriminant function. The closer Wilks' lambda is to 0, the more the variable contributes to the discriminant function. The table also provide a Chi-Square statsitic to test the significance of Wilk's Lambda. If the e-value if less than 0.05, we can conclude that the corresponding function explain the group membership well.

A goodness-of-fit parameter, Wilks’ lambda, is defined as follows:

where λj is the jth eigenvalue corresponding to the eigenvector described above and m is the minimum of C-1 and p.

Hivatkozások

KAPCSOLÓDÓ DOKUMENTUMOK

Although the preconditions of Chi-square test are not satisfied (Table 12): 43.8% of cells have expected count less than 5, therefore the related null-hypothesis can be not

If we regard an ODE as a function which orders value of steepness to the points of the place then the point serial giving the solution can be written by the help of vector

According to the present results we conclude that although the developmental pattern is clear in some aspects of children’s dreams, we also found that even preschoolers are able

As shown in Table 8 above, using the Pearson Chi- square, it could be observed that the significance value is 0.087 which is greater than the p-value = 0.05, hence we accept

For the determination of a single ERR value seyeral deter- minati()ns haye to be carried out with sample" of idcntical moisture content, at identical

If at thread k , an ONU requests less than the minimum guaranteed bandwidth, at the next polling cycle of the corresponding thread, the ONU is granted the requested bandwidth by

Colour is both a technical and an artistic tool for designers of coloured environment. Unambiguous distinction by codes is required in the first case~ to assign

If in the expression of <Pll(s) the transfer function Wc(s) is replaced by the physically realizable optimum transfer function Wcm(s) and the mean square