|
1
|
- Sue Ellen Wright
- Kent State University
- Institute of Applied Linguistics
- ©Sue Ellen Wright 2006
|
|
2
|
- Model for visualizing information
- An amorphous flow of undifferentiated “stuff”
- An aggregate of individual elements (little packages, components) that
can be identified, delimited, organized (modeled), stored, retrieved,
manipulated, and reused
- Stuff people can figure out if they think about it
- Stuff a computer program can automatically recognize and process
|
|
3
|
- A shopping basket is a container to put stuff in.
- The stuff in the basket can be tossed in loose without packaging.
- Stuff gets lost.
- Stuff gets mixed up.
|
|
4
|
|
|
5
|
- Keeps products clean and uncontaminated
- Makes them easy to identify
- Makes them easy to store
- Makes them easy to reuse
|
|
6
|
|
|
7
|
|
|
8
|
- Termbases form nested structures
- Large structures
- Analogy:
- Boxes inside of boxes
- Matryoshka dolls
|
|
9
|
|
|
10
|
- Top level: termbase = virtual files
- Actual file components:
- Master file
- History file
- Index
- Lok file
|
|
11
|
- Termbase files contain Terminological Entries (Term Entries)
- One concept
- All languages
- All terms
- All descriptive info
- All administrative info
|
|
12
|
- Other entry types
- Bibliographical entries
- Responsibility entries
- Other shared resources
- Thesaurus information
- Classification systems
- Info on external resources
|
|
13
|
- Term entries are hierarchical
- Term Entry
- Language group
- All the terms associated
with a language
- Term info group (tig)
- All the info associated
with a given term
|
|
14
|
- Big blue box: the termbase master file
- Smaller aqua box:
the term entry:
- Holds data fields
- Data categories
- Data elements
|
|
15
|
- Smaller mauve box: the language set
- All term info groups for a given language
- Any info that pertains just to a given language
- Small green box: the term information group
- A single term
- All related information
|
|
16
|
- Data Modeling Variance
- Granularity
- Choice of level for a data element concept (field name/attribute)
- Data Element Autonomy
- Combinability and repeatability
- Elemental nature of data elements
- Shared Resources
|
|
17
|
- The degree of detail that can be achieved by using the available data
fields (data categories) to document terminological information
- Ex: Grammar vs. (low
granularity)
- Part of speech (high granularity)
- Gender
- Number
|
|
18
|
- Low level of granularity:
- Grammar: noun, masculine, singular
- High level of granularity:
- Part of speech: noun
- Gender: masculine
- Number: singular
- Advantage of granularity: retrievability
- Disadvantage: more work
|
|
19
|
|
|
20
|
|
|
21
|
|
|
22
|
- Term autonomy: each term has its own field
- Which is combinable with a full set of descriptive data categories
- Which are in turn repeatable throughout the term information group
|
|
23
|
- Only one kind of thing can occupy a data element
- e.g., no terms or synonyms listed as such in definition fields
- Only one of a thing can occupy a data element
- e.g., only one term in a term field
|
|
24
|
|
|
25
|
|
|
26
|
|
|
27
|
|
|
28
|
|
|
29
|
|
|
30
|
- Graphics
- Charts
- Audio
- Video
- Drawings
- Disk Archives
- Responsibility Records
|
|
31
|
|
|
32
|
|
|
33
|
|
|
34
|
|
|
35
|
- Terms
- Term autonomy
- All terms are created equal
- One term per term element
- Complete documentation of each term possible
|
|
36
|
- Main entry term
- Synonym
- Abbreviation
- Full form
- Variant
- Phrase
- Collocation
- Boilerplate
|
|
37
|
- Term
- Part of speech
- Grammatical gender
- Grammatical number (use when necessary)
- Term type (Type) (see next slide)
- Status
- Regional label (not in our model)
- Pronunciation (not in our model)
- Register (usage register)
|
|
38
|
|
|
39
|
|
|
40
|
|
|
41
|
- Definition
- Source ID
- (Points to shared resources)
- Definition type
- Administrative info
|
|
42
|
- Context
- Source ID
- (Points to shared resources)
- Context type
- Administrative info
|
|
43
|
- Concept relations
- Notes
- Other administrative information
- Bibliographical information
- Special categories, e.g.:
- For standardization
- For inventory control
|
|
44
|
- Index
- Terms
- Term-like elements
- (collocations, boilerplate)
- Text
- Attributes
|
|
45
|
|