275x Filetype PDF File size 0.84 MB Source: legge.psych.umn.edu
Journal of Vision (2018) 18(1):1, 1–13 1
Comparing the minimum spatial-frequency content for
recognizing Chinese and alphabet characters
Department of Biomedical Engineering,
University of Minnesota, Minneapolis, MN, USA
Present address: Athinoula A. Martinos Center for
Biomedical Imaging, Department of Radiology,
Massachusetts General Hospital and Harvard Medical
Hui Wang School, Charlestown, MA, USA $
Department of Psychology University of Minnesota,
Gordon E. Legge Minneapolis, MN, USA $
Visual blur is a common problem that causes difficulty in Introduction
pattern recognition for normally sighted people under
degraded viewing conditions (e.g., near the acuity limit,
when defocused, or in fog) and also for people with Character recognition is a prerequisite for reading
impaired vision. For reliable identification, the spatial and is typically a fast and accurate visual process. It
frequency content of an object needs to extend up to or becomes difficult under degraded visual conditions,
exceed a minimum value in units of cycles per object, suchasreadingsmallsymbolsatalongdistanceorwith
referred to as the critical spatial frequency. In this study, optical defocus, and is especially difficult in patients
we investigated the critical spatial frequency for with severe low vision. The spatial-frequency properties
alphabet and Chinese characters, and examined the of letter recognition have been widely explored.
effect of pattern complexity. The stimuli were divided Previous studies show that the visual system utilizes a
into seven categories based on their perimetric spatial frequency of 1–3 cycles per letter (CPL) for
complexity, including the lowercase and uppercase
alphabet letters, and five groups of Chinese characters. reliable identification (Alexander, Xie, & Derlacki,
Wefound that the critical spatial frequency significantly 1994; Chung, Legge, & Tjan, 2002; Ginsburg, 1978;
increased with complexity, from 1.01 cycles per Gold, Bennett, & Sekuler, 1999; Legge, Pelli, Rubin, &
character for the simplest group to 2.00 cycles per Schleske, 1985; Parish & Sperling, 1991; Solomon &
character for the most complex group of Chinese Pelli, 1994), with the optimal spatial frequency de-
characters. A second goal of the study was to test a pending somewhat on the angular size of letters (Majaj,
space-bandwidth invariance hypothesis that would Pelli, Kurshan, & Palomares, 2002). Kwon and Legge
represent a tradeoff between the critical spatial (2011) reported that accurate letter identification is
frequency and the number of adjacent patterns that can possible with letters containing spatial frequencies only
be recognized at one time. We tested this hypothesis by upto0.9CPL.Theseauthorsappliedlowpassfiltersto
comparing the critical spatial frequencies in cycles per images of letters and faces and obtained psychometric
character from the current study and visual-span sizes in functions showing recognition performance (percent
number of characters (measured by Wang, He, & Legge, correct) as a function of the cutoff frequency of the
2014) for sets of characters with different complexities. filters. They referred to the minimal spatial-frequency
For the character size (1.28) we used in the study, we requirement for pattern recognition (with 80% accura-
found an invariant product of approximately 10 cycles, cy) as the critical spatial frequency.
which may represent a capacity limitation on visual Chinese characters differ from alphabetic characters
pattern recognition.
in having a wider range of pattern complexities.
Studying Chinese character recognition may elucidate
the connection between pattern recognition and pattern
complexity. The goal of our study was to determine the
critical-frequency requirements for Chinese characters,
and to examine the effect of pattern complexity.
Citation: Wang, H. & Legge, G. E. (2018). Comparing the minimum spatial-frequency content for recognizing Chinese and
alphabet characters. Journal of Vision, 18(1):1, 1–13, https://doi.org/10.1167/18.1.1.
https://doi.org/10.1167/18.1.1 Received April 11, 2017; published January 2, 2018 ISSN 1534-7362 Copyright 2018 The Authors
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
Journal of Vision (2018) 18(1):1, 1–13 Wang & Legge 2
Critical cutoff frequencies can be expressed in both complexity increases (Wang et al., 2014). If critical
retinal spatial frequency (cycles per degree) or image- frequencies are found to increase with complexity, it is
based spatial-frequency (cycles per character; CPC). In possible that the product of critical frequency and
this paper, we will usually refer to spatial frequencies visual-span size may be constant, representing a form
(including cutoff frequencies) in cycles per character. of capacity limitation on visual pattern recognition. In
Anexception will be our consideration of the effects of the context of this paper, we refer to the bandwidth of
the contrast sensitivity function (CSF) in the Discus- the low-pass filter as the range from zero to the critical
sion. frequency. For simplicity, we used the term bandwidth
Previous studies have shown that the acuity limit for instead of the critical frequency in our hypothesis.
recognizing Chinese characters with more strokes The study of character recognition has important
requires larger size (Cai, Chi, & You, 2001; Chi, Cai, & practical implications for reading performance. It is
You, 2003; Huang & Hsu, 2005). Chinese characters known that a critical frequency is required for
with more strokes also have higher contrast thresholds uncompromised reading speed in alphabet reading
(Yen & Liu, 1972) and longer response times (Yu & (Kwon&Legge,2012).Therefore,studying the spatial-
Cao, 1992). However, reports on the spatial frequency frequency requirements for Chinese characters may be
properties of Chinese character recognition are scarce. relevant to Chinese reading under low-resolution
Chen, Yeh, and Lin (2001) adopted the critical-band– conditions including low vision. It may also have
maskingparadigmusedbySolomonandPelli(1994)to practical applications in designing reading material for
investigate the best central frequencies for Chinese difficult viewing conditions.
characters. They tested Chinese characters with 3 to 21
strokes, and reported an average spatial frequency of
approximately 8 CPC. The study however, did not take Methods
the variation of complexities into account, and did not
investigate the minimal spatial-frequency requirements
for Chinese character recognition. Subjects
In this study, we explored the critical spatial-
frequency requirements for alphabet and Chinese Six college students (three men, three women) with
characters, and examined the effect of complexity on normal or corrected-to-normal vision participated in
these requirements. As the more complex characters the experiments. They were all native Chinese speakers,
have broader spatial-frequency spectra than the simple originally educated in the simplified Chinese script
characters, they may require higher spatial frequency system, and all had more than 10 years education in
for character recognition. We divided alphabet char- English. The subjects signed an Internal Review Board
acters and Chinese characters into categories, based on (IRB) approved consent form before the experiments.
ranges of complexity values, using the perimetric
complexity metric (Arnoult & Attneave, 1956; Pelli,
Burns, Farell, & Moore-Page, 2006). The perimetric Stimulus sets
complexity of a symbol is defined as its perimeter
squared divided by its ink area. We showed The stimulus characters were lowercase (LL) and
previously (Wang et al., 2014) that the perimetric uppercase (UL) alphabet letters in the Arial font, and
complexity metric has high correlation with other simplified Chinese characters in the Heiti font in which
complexity metrics, such as the number of strokes, the all the strokes have the same width.
stroke frequency (Majaj et al., 2002; Zhang, Zhang, The 700 most frequently used Chinese characters
Xue, Liu, & Yu, 2007) and the skeleton method (State Language Work Committee, 1992) were divided
(Bernard & Chung, 2011). For each complexity into five nonoverlapping groups based on their
category, we measured recognition performance for perimetric complexity values (Pelli et al., 2006).
sets of 26 characters as a function of the cutoff Twenty-six characters whose complexity values were
frequency of low-pass filters. close to the mean of the group were selected to form
Asecond goal of this study was to test an empirical five sets of symbols (C1–C5). Characters with very high
hypothesis of a tradeoff between the critical frequency or low similarity were excluded from the stimulus sets.
for character recognition and the visual span for Ameasure of similarity for the characters in each set
character recognition; we term this the space-band- was computed using a normalized Euclidean distance
width invariance hypothesis. The visual span is the method (Wang et al., 2014).
number of characters that can be recognized without To determine whether subjects familiarity with the
moving the eyes. We have examined the size of the characters affected their performance, we included a
visual span for alphabet letters and Chinese characters, groupofChinesecharacterswithlowerusagefrequency
and discovered that the visual span size decreases as in text but comparable in complexity with characters in
Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
Journal of Vision (2018) 18(1):1, 1–13 Wang & Legge 3
f ¼ 1 ð1Þ
1þ r 2n
c
where r is the radius of the components in the
frequency domain, c is the radius of the cutoff
frequency, and n is the order of the filter. Figure 2A
demonstrates the response function of the low-pass
filter in the spatial-frequency domain.
To test the recognition accuracy as a function of
blurring levels, six cutoff frequencies were selected for
each stimulus set while character size remained
constant. A demonstration of the characters with and
without low-pass filtering is shown in Figure 2. The sets
of filter cutoffs used for the eight complexity groups
were chosen based on recognition performance in pilot
Figure 1. Representative characters from the eight stimulus sets runs. We ensured that the cutoffs were selected so that
(LL, UL, C1–C5, and C30). The complexity gradually increases in recognition accuracy spanned a wide range, and the
the first seven rows (from LL to C5). The bottom row (C30) psychometric function exhibited a clear transition from
shows a group with comparable complexity to C3, but lower low to high performance accuracy. The cutoffs used for
familiarity. each stimulus set are summarized in Table 2.
the group C3. We did this by identifying the next 700
most frequent Chinese characters and divided them Image display
into five complexity groups as well, based on the same
complexity metric. Twenty-six characters were selected The stimuli were displayed on a 19 in. CRT monitor
to comprise a comparison group (C30), which had (refresh rate: 75 Hz, resolution: 1280 3 960). The
comparable complexity with C3 but lower frequency luminance of the blurred images on the screen was
and presumably lower familiarity. The pattern com- mapped onto 256 gray levels. The background of the
plexity in the 1,400 most frequently used characters image was set to the gray level 127, corresponding to a
2
covers most of the complexity range across all meanluminance of 40 cd/m . Luminance of the display
simplified Chinese characters. Remaining characters monitor was made linear using an 8-bit lookup table in
with even higher complexities are rarely used in conjunction with photometric readings from a Konica
ordinary reading. Five representative characters from Minolta CS-100 Chroma Meter (Konica Minolta
each stimulus set are shown in Figure 1. Statistics of the Sensing Americas, Inc., Ramsey, NJ). The image
perimetric complexity values for each stimulus set are luminance values were mapped onto the values stored
given in Table 1. in the lookup table for the display. The character image
was displayed at the center of the screen. The stimulus
symbol was created and controlled using MATLAB
Low-pass filtering (MathWorks, Natick, MA) and Psychophysics Tool-
box extensions (Brainard, 1997; Pelli, 1997; Kleiner et
al., 2007), running on a Mac Pro computer (Apple,
Ablack character was generated on a gray back- Cupertino, CA).
ground and stored as a grayscale image. The size of the
image was 2503250 pixels, and the size of the
characters (height of Chinese characters and x-height of Procedure
alphabet letters) subtended 1.28 visual angle at a
viewing distance of 40 cm. The image was blurred Each subject participated in three test sessions on
through a third order Butterworth low-pass filter (f) three days. One session consisted of eight blocks: seven
given by the following equation: blocks with varied complexity levels (LL, UL, C1–C5),
Group LL UL C1 C2 C3 C4 C5 C30
Complexity mean (SD) 48.6 (11.7) 66.5 (17.9) 98.0 (6.3) 136.9 (2.3) 176.6 (4.3) 216.2 (5.0) 280.1 (33.7) 182.0 (5.2)
Table 1. Perimetric complexity measures for the stimulus sets. Note: LL, lowercase letter; UL, uppercase letter; C1–C5, five sets of
Chinese characters from the simplest to the most complex; C30, Chinese character group of comparable complexity with C3 but less
familiarity.
Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
Journal of Vision (2018) 18(1):1, 1–13 Wang & Legge 4
Figure 2. (A) The response function of the third-order Butterworth filter in the spatial frequency domain.The arrow indicates a cutoff
frequency of 1.5 cycles per character (CPC) for a 18 letter size. The filter’s cutoff is defined as the frequency at half amplitude. (B)
Demonstration of low-pass filtered Chinese characters from the five complexity categories. The right column shows the unfiltered
character.
and one block with complexity equivalent to C3 but center of the screen. In each trial, a character was
lower character familiarity (C30). In each block, there presented for 200 ms at fixation. After that, the display
were 25 trials for each of six cutoffs forming a total of 2,
became uniform at the background level of 40 cd/m
150 trials. The stimulus symbol was randomly selected and the subject was asked to report the character. The
from the 26-character set, and the order of the cutoff experimenter recorded the responses, and the subject
frequencies presented was shuffled. The resulting clicked the mouse to start the next trial. A reference
psychometric functions for a given complexity category page was available, showing the 26 symbols in the
were therefore based on 450 trials (six cutoff frequen- current category, if the subject had trouble recalling the
cies and 75 trials per cutoff frequency). The orders of characters in the set. Subjects rarely responded with
the blocks were counterbalanced between sessions and characters outside of the stimulus category (,1% of
subjects. trials.) The 26 unfiltered characters were tested at the
The subject was shown the 26 unfiltered symbols on end of every block in order to evaluate the baseline
a hard copy page before the start of a block and urged performance for recognition. Performance on the
to restrict responses to the stimulus set. During test unfiltered stimuli was at the ceiling value of 100%.
trials, the subject was directed to fixate on a cross at the Achin rest was used during the test to reduce head
movements and to maintain the viewing distance.
Practice trials, including all the stimulus sets and the
Group f1 f2 f3 f4 f5 f6 filter cutoffs, were provided at the beginning of the test.
LL 0.78 1.02 1.27 1.49 1.80 2.16
UL 0.78 1.02 1.27 1.49 1.80 2.16 Data analysis
C1 0.78 1.02 1.27 1.49 1.80 2.16
C2 0.92 1.18 1.42 1.63 1.94 2.34 The character recognition accuracy was plotted
C3/C30 1.08 1.32 1.57 1.79 2.1 2.52 against the cutoff frequencies for each stimulus set.
C4 1.24 1.44 1.73 1.94 2.28 2.66 Cumulative Gaussian functions (Wichmann & Hill,
C5 1.30 1.54 1.87 2.09 2.46 2.82 2001)wereusedtofittheplotswiththeleast-square
Table 2. Butterworth filter cutoff frequencies (in cycles per criterion. The critical spatial frequency was estimated
character; CPC) used for recognition tests with the seven from the psychometric function, and defined as the
complexity categories. Note: LL, lowercase letter; UL, uppercase cutoff frequency yielding 80% correct responses. It is
letter; C1–C5, five sets of Chinese characters from the simplest noted that the guessing level of the psychometric
to the most complex; C30, Chinese character group of functionsis1/26¼3.85% for all the groups, because
comparable complexity with C3 but less familiarity. there are 26 stimuli in each complexity set. Figure 3
Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
no reviews yet
Please Login to review.