Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Reading LevelReading Level
-
Content TypeContent Type
-
YearFrom:-To:
-
More FiltersMore FiltersItem TypeIs Full-Text AvailableSubjectPublisherSourceDonorLanguagePlace of PublicationContributorsLocation
Done
Filters
Reset
13,880
result(s) for
"Second language tests"
Sort by:
Introducing LexTALE: A quick and valid Lexical Test for Advanced Learners of English
2012
The increasing number of experimental studies on second language (L2) processing, frequently with English as the L2, calls for a practical and valid measure of English vocabulary knowledge and proficiency. In a large-scale study with Dutch and Korean speakers of L2 English, we tested whether LexTALE, a 5-min vocabulary test, is a valid predictor of English vocabulary knowledge and, possibly, even of general English proficiency. Furthermore, the validity of LexTALE was compared with that of self-ratings of proficiency, a measure frequently used by L2 researchers. The results showed the following in both speaker groups: (1) LexTALE was a good predictor of English vocabulary knowledge; 2) it also correlated substantially with a measure of general English proficiency; and 3) LexTALE was generally superior to self-ratings in its predictions. LexTALE, but not self-ratings, also correlated highly with previous experimental data on two word recognition paradigms. The test can be carried out on or downloaded from
www.lextale.com
.
Journal Article
Raters' L2 background as a potential source of bias in rating oral performance
2013
Based on evidence that listeners may favor certain foreign accents over others and that language-test raters may better comprehend and/or rate the speech of test takers whose native languages (L1s) are more familiar on some level, the authors investigated whether accent familiarity (defined as having learned the test takers' L1) leads to rater bias. They examined 107 raters' ratings on 432 TOEFL iBTTM speech samples from 72 test takers. The raters of interest were L2 speakers of Spanish, Chinese, or Korean, while the test takers comprised three native-speaker groups (24 each) of Spanish, Chinese, and Korean. The authors analyzed the ratings using a multifaceted Rasch measurement approach. Results indicated that L2 Spanish raters were significantly more lenient with L1 Spanish test takers, as were L2 Chinese raters with L1 Chinese test takers. They conclude that rater training should address raters' linguistic background as a potential rater effect. Furthermore, the importance of recognizing rater L2 as a possible source of bias is being discussed. (Verlag, adapt.).
Journal Article
Measuring authorial voice strength in L2 argumentative writing: The development and validation of an analytic rubric
2013
Although a key concept in various writing textbooks, learning standards, and writing rubrics, voice remains a construct that is only loosely defined in the literature and impressionistically assessed in practice. Few attempts have been made to formally investigate whether and how the strength of an author's voice in written texts can be reliably measured. Using a mixed-method approach, this study develops and validates an analytic rubric that measures voice strength in second language (L2) argumentative writing. Factor analysis of ratings from six raters on voice strength in a total of 400 TOEFL® iBT writing samples, together with qualitative analysis of four raters' in-depth think-aloud and interview data, points to an alternative conceptualization of voice that sees authorial voice in written discourse as being realized primarily through the following dimensions: (1) the presence and clarity of ideas in the content; (2) the manner of the presentation of ideas; and (3) the writer and reader presence. Implications of such results for L2 writing instruction and assessment are discussed. (Verlag).
Journal Article
A comparison of video- and audio-mediated listening tests with many-facet Rasch modeling and differential distractor functioning
2015
The rise in the affordability of quality video production equipment has resulted in increased interest in video-mediated tests of foreign language listening comprehension. Although research on such tests has continued fairly steadily since the early 1980s, studies have relied on analyses of raw scores, despite the growing prevalence of item response theory in the field of language testing as a whole. The present study addresses this gap by comparing data from identical, counter-balanced multiple-choice listening test forms employing three text types (monologue, conversation, and lecture) administered to 164 university students of English in Japan. Data were analyzed via many-facet Rasch modeling to compare the difficulties of the audio and video formats; to investigate interactions between format and text-type, and format and proficiency level; and to identify specific items biased toward one or the other format. Finally, items displaying such differences were subjected to differential distractor functioning analyses. No interactions between format and text-type, or format and proficiency level, were observed. Four items were discovered displaying format-based differences in difficulty, two of which were found to correspond to possible acting anomalies in the videos. The author argues for further work focusing on item-level interactions with test format. (Verlag).
Journal Article
Predicting the proficiency level of language learners using lexical indices
by
Crossley, Scott A
,
McNamara, Danielle S
,
Salsbury, Tom
in
Analyse
,
Case Studies
,
Classification
2012
This study explores how second language (L2) texts written by learners at various proficiency levels can be classified using computational indices that characterize lexical competence. For this study, 100 writing samples taken from 100 L2 learners were analyzed using lexical indices reported by the computational tool Coh-Metrix. The L2 writing samples were categorized into beginning, intermediate, and advanced groupings based on the TOEFL and ACT ESL Compass scores of the writer. A discriminant function analysis was used to predict the level categorization of the texts using lexical indices related to breadth of lexical knowledge (word frequency, lexical diversity), depth of lexical knowledge (hypernymy, polysemy, semantic co-referentiality, and word meaningfulness), and access to core lexical items (word concreteness, familiarity, and imagability). The strongest predictors of an individual's proficiency level were word imagability, word frequency, lexical diversity, and word familiarity. In total, the indices correctly classified 70% of the texts based on proficiency level in both a training and a test set. The authors argue for the applicability of a statistical model as a method to investigate lexical competence across language levels, as a method to assess L2 lexical development, and as a method to classify L2 proficiency. (Verlag).
Journal Article
Effects of L1 definitions and cognate status of test items on the Vocabulary Size Test
2013
This study examines the development and evaluation of a bilingual Vocabulary Size Test (VST, Nation, 2006). A bilingual (English-Russian) test was developed and administered to 121 intermediate proficiency EFL learners (native speakers of Russian), alongside the original monolingual (English-only) version of the test. A comparison of the bilingual and monolingual test scores showed that participants achieved significantly higher scores on the bilingual version of the test. Accuracy of responses to individual test items was reliably higher when the meanings of test items were presented in the L1 (Russian) and when these items were cognates. The findings also revealed that the bilingual version is likely to be a more sensitive measure of written receptive vocabulary knowledge. Finally, analyses showed that the effect of using L1 for multiple-choice options is likely to be larger for low-proficiency learners and that the difference in response accuracy to cognates and non-cognates decreases as item frequency increases. The paper concludes with recommendations on developing and using bilingual vocabulary size tests. (Verlag).
Journal Article
At the interface between language testing and second language acquisition: Language ability and context of learning
2014
This study investigated the relationship between latent components of academic English language ability and test takers' study-abroad and classroom learning experiences through a structural equation modeling approach in the context of TOEFL iBT® testing. Data from the TOEFL iBT public dataset were used. The results showed that test takers' performance on the test's four skill sections, namely listening, reading, writing, and speaking, could be accounted for by two correlated latent components: the ability to listen, read, and write, and the ability to speak English. This two-factor model held equivalently across two groups of test takers, with one group having been exposed to an English-speaking environment and the other without such experience. Imposing a mean structure on the factor model led to the finding that the groups did not differ in terms of their standings on the factor means. The relationship between learning contexts and the latent ability components was further examined in structural regression models. The results of this study suggested an alternative characterization of the ability construct of the TOEFL test-taking population, and supported the comparability of the language ability developed in the home-country and the study-abroad groups. The results also shed light on the impact of studying abroad and home-country learning on language ability development. (Verlag).
Journal Article
An investigation of the use of TOEFL(TM) Junior(TM) Standard scores for ESL placement decisions in secondary education
2014
A common use of language tests is to support decisions about examinees such as placement into appropriate classes. Research on placement testing has focused on English for Academic Purposes (EAP) in higher education contexts. However, there is little research exploring the use of language tests to place students in English as a Second Language (ESL) support classes in secondary education. The present study examined the relationship between secondary school students' language test scores from a standardized English-language test and the placement of these students into ESL classes by their language teachers. Ninety-two ESL students in two English-medium schools took TOEFL(R) Junior(TM) Standard. For the same students, data collection also included teachers' judgments regarding the ESL classes the students should attend. Strong correlations between test scores and the teacher-assigned ESL levels were found. Moreover, the results from the logistic regression analysis indicated a great degree of overlap between the teacher-assigned ESL levels and the levels predicted from the TOEFL Junior Standard scores. The findings of this study provide some preliminary evidence to support the use of TOEFL Junior Standard as an initial screening tool for ESL placement. The limitations and implications of these findings for ESL placement decisions in secondary education are also discussed. (Verlag).
Journal Article
Measuring Syntactic Complexity in L2 Writing Using Fine-Grained Clausal and Phrasal Indices
2018
Syntactic complexity is an important measure of second language (L2) writing proficiency. Large-grained indices such as the mean length of T-unit (MLTU) have been used with the most consistency in L2 writing studies. Recently, indices such as MLTU have been criticized, both for the difficulty in interpretation and for a potentially misplaced focus on clausal subordination. In this article, the authors attempt to address both of these criticisms by using traditional indices of syntactic complexity (e.g., MLTU), fine-grained indices of clausal complexity, and fine-grained indices of phrasal complexity to predict holistic scores of writing quality. In 4 studies, they used indices of each index type to predict holistic writing quality scores in independent essays on the Test of English as a Foreign Language (TOEFL). They then used all index types in a combined analysis to predict a holistic writing score. The results indicated that fine-grained indices of phrasal complexity were better predictors of writing quality than either traditional or fine-grained clausal indices, though a single fine-grained index of clausal complexity contributed to the combined model. These results provide some support for Biber et al.'s (2011) claims regarding complexity and academic L2 writing proficiency. (Verlag, adapt.).
Journal Article