https://github.com/writecrow/crow_frontend
The user interface for the Corpus & Repository of Writing, built in Angular
https://github.com/writecrow/crow_frontend
angular corpora corpus corpus-builder corpus-linguistics natural-language-processing
Last synced: 2 months ago
JSON representation
The user interface for the Corpus & Repository of Writing, built in Angular
- Host: GitHub
- URL: https://github.com/writecrow/crow_frontend
- Owner: writecrow
- License: mit
- Created: 2018-10-08T01:36:08.000Z (over 6 years ago)
- Default Branch: main
- Last Pushed: 2025-02-13T15:22:59.000Z (4 months ago)
- Last Synced: 2025-02-13T16:33:11.299Z (4 months ago)
- Topics: angular, corpora, corpus, corpus-builder, corpus-linguistics, natural-language-processing
- Language: TypeScript
- Homepage: https://crow.corporaproject.org
- Size: 1.73 MB
- Stars: 1
- Watchers: 6
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Corpus & Repository (Frontend)
This is the source code for the front-end of a web-based corpus. What does that mean? A corpus is a collection of texts which can be searched & analyzed. A front-end is the part of a web application that is responsible for the user interface, for sending requests for data & queries to the "back-end," and then displaying that data.This front-end is designed for a specific dataset, the Corpus and Repository of Writing (Crow), which is an inter-institutional collection of student writing. However, it can also provide a starting point for designing a corpus with a different dataset. And the Crow team is available for support & consulting.
For the live version of the site, visit https://crow.corporaproject.org.
# Table of contents
* [Quickstart](#quickstart)
* [The architecture of this web-based corpus](#architecture)
* [Building your own corpus](#building-your-own-corpus)
* [How to make changes to the code](#usage)
* [How to construct requests via the API](#the-api)
* [Updating Angular](#updating-angular)
* [Linkages between corpus & repository](#linkages)# Quickstart
(This assumes [NodeJS](https://nodejs.org/en/) is installed.)
1. Adjust the backend host as needed in `src/environments/environment.ts`
2. Get a local host running:
```
[email protected]:writecrow/crow_frontend.git
npm install
npm install -g @angular/cli
ng serve
```Navigate to `http://localhost:4200/`. The app will automatically reload if you change any of the source files.
# Architecture
Corpora are big-data applications. They must be able to perform complex searches on large datasets. Thanks to the precedents of Google and Amazon, people expect web-based search portals to be fast. Very fast. In order to meet this expectation, a number of contemporary web design patterns must be followed:- **Decoupled display/data** (see https://pantheon.io/decoupled-cms): instead of delivering all of a web page every time a user performs an action, only update the part of the web page that needs to change.
- **Multiple layers of caching**: the result of computational operations that will yield the same result (until the dataset is updated) can be stored and the memory-intensive operation can be skipped.
- **Search index pre-processing**: each searchable thing (i.e. word) in each resource (i.e., text) in the dataset (i.e, the corpus) must be indexed beforehand so that millions of words don't have to be individually crawled at runtime.In order to achieve the above, this site was built:
- Using [AngularJS](https://angularjs.org/), a JavaScript framework whose business logic lives in the browser (and therefore greatly reduces the amount of data that must be sent across the internet on each user action)
- Using time-based caching on the front-end, and change-sensitive caching on the back-end
- Using pre-loading of data from the back-end to predict the data a user may request before it is requested# Building your own corpus
Developing a modern web application requires a not insignificant amount of knowledge not only about a given programming language, but also about programming principles, how data is transferred across the internet, and about the specific framework on which an application is built (in this case, [AngularJS](https://angularjs.org/)).The following section will explain how to get up and running for developing with this application, but be forewarned: it may be daunting for non-developers; at the bare minimum, it assumes a familiarity with the command line, `git`, package management, integrated development environments, and local hosting.
These instructions also assume a UNIX-like environment (e.g., Linux, Mac), so if you're using Windows, you'll need to make some accommodations.
1. If you haven't done so, install `node.js` & `npm` : https://docs.npmjs.com/getting-started/installing-node
1. Download this repository as you would normally: `git clone https://github.com/writecrow/corpus_frontend.git`
1. From the directory of the download, run `npm install` to build local components.
1. Run `ng serve` for a dev server.
1. Navigate to `http://localhost:4200/`. The app will automatically reload if you change any of the source files.## Compiling the code for production
1. If not already present, copy the src/environments/environment.dev.ts into `src/environments/environment.prod.ts` and copy values from the "Crow build values" secure note.
2. Run `ng build --configuration=production` to build the project. The build artifacts will be stored in the `dist/` directory.
3. Deploy with the `deploy` helper bash tool, which will rsync the `dist` directory to the production server.# Usage
This application is built on the [Angular](https://angularjs.org/) framework, and follows the best practices of said framework. For those familiar with AngularJS, making modifications to the code should be self-obvious.However, for complete beginners, here are a few orientation tips for making general changes, assuming you have successfully got the site running locally at `http://localhost:4200/`; see above):
- All functional code is located in the `src/app` directory
- The file that defines the "pages" (routes) is `src/app/app.module.ts`
- The back-end base URL is defined in `src/environments/environment.ts`. If you have a different back-end, change it here.Each "component" of the application consists of
- the `.ts` file, which controls the business logic of route, and the dynamic variables available to the template
- the `.html` file, which defines the markup that will display on the page, and the placeholders for the dynamic data which can be passed from the `.ts` file
- the `.css` file, which adds display elements to the component# The API
All dynamic data is retrieved from the backend API, whose base URL is defined in `src/app/api/restAPI.service.ts`.Requests for specific data can be performed by the parameters explained in the backend repository, at https://github.com/writecrow/corpus_backend#performing-search-requests-via-the-api
For example, the following URL:
https://writecrow.corporaproject.org/texts/word?search=friend+woman&op=or&_format=json
will return all texts via a standard word search that include the word "friend" or "woman", returned in JSON format.
The same result could be returned in XML format with the URL:
https://writecrow.corporaproject.org/texts/word?search=friend%20woman&op=or&_format=xml
A result set that requires "friend" AND "woman" would be:
https://writecrow.corporaproject.org/texts/word?search=friend%20woman&op=and&_format=json
For a systematic explanation of requesting data from the back-end, see https://github.com/writecrow/corpus_backend#performing-search-requests-via-the-api
## Updating Angular
See [https://update.angular.io/](https://update.angular.io/)Replace `@14` with the target version, along with the applicable typescript version specified by the command's output:
```
ng update @angular/core@14 @angular/cli@14 @angular-eslint/builder@14 @angular-eslint/schematics@14 @angular/platform-browser-dynamic@14 @angular/router@14 @angular/platform-browser@14 @angular/material@14 @angular/forms@14 @angular/compiler@14 @angular/cdk@14 @angular/animations@14 @angular-devkit/build-angular@14 @angular/compiler-cli@14 @angular/language-service@14 [email protected]
```# Linkages
The corpus & repository show related materials based on common metadata. The current logic of these linkages is summarized below:### Repository interface
Examples of all 5 linkages can be seen at https://crow.corporaproject.org/repository/14701. "Instructor materials related to this assignment"
- Filter logic: institution + course + semester + assignment + year + instructor2. "Other materials from this instructor"
- Filter logic: institution + course + different assignment or is syllabus + semester + year + instructor3. "Materials from other instructors"
- Filter logic:institution + course + semester + assignment or IS syllabus + year + different instructor4. "Student texts associated with this material"
- Filter logic: institution + course + assignment + semester + year + instructor5. "Student texts from other instructors"
- Filter logic: institution + course + assignment + different instructor### Corpus interface
Examples of all 4 linkages can be seen at https://crow.corporaproject.org/corpus/107_LN_3_SAU_1_M_10607_UA_c1. "Similar texts from this instructor's course"
- Filter logic: institution + course + instructor + assignment2. "Similar texts from other instructors' courses"
- Filter logic: institution + course + assignment + different instructor3. "Instructor materials related to this assignment"
- Filter logic: institution + course + instructor + year + semester + assignment OR is a syllabus4. "Materials by other instructors"
- Filter logic: institution + course + assignment OR is a syllabus + different instructor