bridging formal and informal learning for second language writing in flax
TRANSCRIPT
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Alannah Fitzgerald & Shaoqun Wu
Image: https://www.flickr.com/photos/38446022@N00/4866805870
Overview• The FLAX project vision for building language
collections• MOOC (Massive Open Online Course) content
• Reuse across formal and informal learning• Open Educational Resources (OER) as corpora• Open Access publications as corpora
• Free gamed-based apps for Android devices for interacting with FLAX collections
Informal learning - Moocs Massive open online courses
MOOC content in FLAX
FLAX language help in Coursera
Digital library search interface
Law collections in FLAX
Openness in corpus development
The eBook of FLAX “FLAX enables teachers to build bespoke libraries very easily. It is built upon powerful digital library technology, and provides access to vast linguistic resources containing countless examples of actual, authentic, usage in contemporary text. But teachers can also build collections using their own material, focusing on language learning in a particular domain (e.g. business, law) or motivating students by using text from a particular context (e.g. country or region, common interests).”
http://flax-doc.nzdl.org/BOOK_OF_FLAX/BookofFLAX%20fullsize%20with%20links.pdf
EAP teacher-developed corpora
http://creativecommons.org/
http://www.slideshare.net/cavlec/open-sesame-and-other-open-movements
BLaRC Open Access legal corpus
http://flax.nzdl.org/greenstone3/flax?a=fp&sa=collAbout&c=BlaRC&if=
Openness in corpus building
Alannah: You know, my next question: Could you have even built the BLaRC without those open government licenses on all of those documents in the BAILII (British and Irish Legal Information Institute)?
Maria: No, that’s the thing, that’s the thing. The amazing discovery was the BAILII... and they have added a lot of overseas legal documents, including United States documents. They have the whole planet in there. It’s amazing how much stuff you can find. So, to me it was a huge, huge discovery. That was the best thing that could happen to me. That’s why I started my research on legal corpora. I mean that was one of the reasons.
Alannah: Access is so key isn’t, it? And, I’m sure that’s a big part of why the BAILII exists as well because they knew people couldn’t access LexusNexus.
Pilot study on open corpus reuse in formal learning
Construcción de Corpus para I + D: Proyecto multimodal FLAX
Maria José Marín-Maria Angeles Orts
http://tv.um.es/video?id=71961&cod=a1
Pilot study on MOOC remix in Legal Translation Studies
• 52 students in the fourth year of the Translation Degree at the University of Murcia (Spain) were selected as informants. All the students’ linguistic competence level complied with the CEFR requirements for the B2 level
• The experimental group (16 informants organised in 4 sub-groups) were requested to only consult the FLAX website as the single source of information to draft their essays. The remaining 36 students (divided into 9 different groups) would act as the control group.
• The subject syllabus did not coincide in its entirety with the course lessons included in FLAX, only 4 out of 13 topics overlapped with the ones listed on the online learning platform.
Domain-specific Collocations
We focus on lexical collocations with noun-based structures because they are the most salient and important patterns in domain-specific text.
Collocations from the English Common Law MOOC: •verb + noun e.g. abolish judicial review•noun + noun e.g. precedent case•adjective + noun e.g. common law•noun + of + noun e.g. court of appeal
Collocations in FLAX
Linking to Learning Collocations Collection (Wikipedia, BNC, BAWE)
Lexical Bundles“Lexical bundles” are multi-word sequences with distinctive syntactic patterns and discourse functions that are commonly used in academic prose (Biber & Barbieri, 2007; Biber et al, 2003, 2004).
Bundles from British Law Report Corpus (BLaRC): •noun phrase + of e.g. In the course of his•prepositional phrase + of e.g. on the part of the•it + verb/adjective phrase e.g. it is common ground that•be + noun/adjective phrase e.g. be taken into account in•verb phrase + that e.g. There is no doubt that
Pilot studyNON-FLAX-BASED TEXTS FLAX-BASED TEXTSText Number
Tokens
TextNumber
Tokens
1 4,025 1 6,1282 10,068 2 4,0663 5,348 3 4,0684 5,480 4 2,6775 9,449 Total 16,9396 8,4957 5,1338 4,0049 3,028Total 55,030
FLAX text subject areasFLAX-BASED TEXTS1. Judicial Decisions: the Meaning of Precedent in Common Law 2. Parliament and Statutes3. History and Peculiarities of the Common Law4. Introduction to the Civil and Common Courts. The European Court. Parliaments and Europe.
Non-FLAX text subject areasNON-FLAX-BASED TEXTS1. Family Law: a Comparison between the Spanish, British and American Systems2. Civil and Criminal Law in the Spanish and Common Law Systems3. International Law4. Powers of Attorney in the Spanish and Common Law Systems5. An overview on Legal Translation in English and Spanish6. Probate Law: Wills in Civil and Common Law Systems7. Contracts: a Comparative Study8. Royal Assent9. Delegated legislation in the UK, USA and Spain
Termostat analysisTERMS IDENTIFIED BY TERMOSTAT (A) (Drouin, 2003)
CORPUS SIZEAFTER REDUCTION
NUMBER OF TOPICS (B)
TERM AVERAGE (A/B)
FLAX CORPUS 226 16,939 4 56.5NON-FLAX CORPUS
385 16,264 9 42.77
Initial findings from pilot study• The experimental FLAX group appears to have acquired the
specialised terminology of the area better than the control Non-FLAX group as attested by the higher term average
• Figures show that in spite of containing a greater proportion of terms, which appear to be more specialised, the FLAX corpus has a poorer lexicon judging by its Standardised Type/token ratio (35.23), 3 points below the same datum obtained from the non-FLAX corpus.
• Finally, it has also been observed that the lexicon of the FLAX corpus tends to be more basic than the vocabulary in the non-FLAX corpus as 78.57% of the words found therein overlapped with the list of the 3,000 most frequent words of English from the British National Corpus, as opposed to 62.03% coincidence with the wordlist extracted from the non-FLAX corpus.
Future research
• Repeating the Murcia study to boost data collection and analysis
• Collecting MOOC learner survey data on openness of MOOC content for the development of corpus derivatives for language support
• Developing diagnostic domain-specific language tests for MOOC learners– Tests derived from MOOC content in law corpora
FLAX Android APPS for teacher-developed & mooc collections
FLAX TEAM Apps for Android via GooglePlay
http://commons.wikimedia.org/wiki/File:Android_robot_skateboarding.svg /
http://commons.wikimedia.org/wiki/File:Google_Play_Store.svg
Collocation GuessingDevelopers’ Interface
Collocations Guessing Learners’ Web Interface
Collocation DominoesDevelopers’ Interface
Collocation DominoesLearners’ Web Interface
Thank You
FLAX Language Project & Software Downloads: http://flax.nzdl.org/ The How-to eBook of FLAX: http://flax-doc.nzdl.org/BOOK_OF_FLAX/BookofFLAX%20fullsize%20with%
20links.pdfFLAX Game-based Apps for Android via Google Play Store (free):
https://play.google.com/store/apps/developer?id=FLAX%20TEAM&hl=en
Ian Witten (FLAX Project Lead): [email protected] Wu (FLAX Research and Development): [email protected]
Alannah Fitzgerald (FLAX Open Language Research): [email protected] José Marín (Legal English Research and Development): [email protected]
Maria Angeles Orts (Legal English Research and Development): [email protected] Jemma Konig (FLAX Apps Development): [email protected] Xiaofeng Alex Yu (FLAX Apps Development): [email protected]
OER Research Hub: http://oerresearchhub.org/ TOETOE Technology for Open English Blog: www.alannahfitzgerald.org
Slideshare: http://www.slideshare.net/AlannahOpenEd/ Twitter: @AlannahFitz