The Research and Development Unit for English Studies is a corpus linguistic research group, working to uncover real-world linguistic patterns and trends.
Research summary
The Research and Development Unit for English Studies has built an international reputation through our work to develop novel computational, statistical, and linguistic methods. We have released a range of free-to-use software to support detailed linguistic analysis.
Research Background
A range of projects have been undertaken, culminating in the production of a set of linguistic tools.
WebCorp Live was designed to test the hypothesis that the web could provide evidence of rare, new, and changing language use – to complement offline text. Previous linguistic research on the web would require manual processing of web pages, but WebCorp Live automatically accesses web pages – via commercial search engines, such as Google – and produces examples of words and phrases to be studied. It can also search in multiple languages, and therefore play a key role in augmenting language teaching and translation.
A sister project, the WebCorp Linguist’s Search Engine, saw the team building a bespoke large-scale collection of web texts to be used for advanced linguistic and statistical analysis. This large-scale sample of the web captured various document formats, types of content and subjects. This was followed by the launch of WebCorp Learn, a tool specifically designed to support interactive English language learning which has been integrated into courses at German secondary schools.
This was taken a step further with the release of OurSurveySays, a management tool for open-text survey analysis and insight. It provides a web-based visualisation package that can be used by non-specialists – for example marketing strategists, academic planners, and course directors – to analyse text-based survey responses. This allows organisations to undertake detailed analysis and make tactical interventions in response to findings.
Aiming to resolve a different problem, the eMargin project addressed the limitations of applying a traditional close-reading approach in modern teaching settings; notes on physical texts become cluttered and are not easily shared or reused, and recreating class-based close reading for distance-learning students is particularly challenging. With no other solutions readily available to solve this problem, the eMargin web-based annotation tool was developed. The tool not only enables collaboration and discussion across multiple locations, but also retains a digital record of students’ progress. Although it was designed as a teaching tool, it can also be used for collaborative textual annotation.
Outcomes and impact
Our online WebCorp, eMargin and OurSurveySays tools bridge significant gaps in textual analysis, enabling novel teaching practices and enhanced insights from otherwise unmanageably large datasets.
With over 5,000 monthly users in 190 countries, our software has:
- Facilitated data-driven language teaching in higher education institutions around the world
- Augmented English language teaching in German secondary schools
- Enriched the teaching of literary analysis and textual interpretation in higher education, further education, and schools, with particular growth during the COVID-19 pandemic
- Improved the accuracy and efficiency of professional translation services
- Enabled the Belgian and Alicantian chapters of Podemos to formulate political policy in a collaborative way
- Informed decision making by university planners and management at five UK institutions.
The WebCorp tools have also featured in over 1,700 publications by researchers across disciplines, with users worldwide.