Standards for Content Creation and Globalization

Language Technology Standards by Basic Application

Language Technology Standards by Sponsoring Body

The Information Cycle Feedback Loop

A Terminology of Language Industry Standards [Under construction]

Logo Standard Abbreviation Sponsor
1.  Base Standards
 
1.1 Markup Languages
JTC1 Standard Generalized Markup Language SGML ISO/IEC JTC1 SC 18
 

An international standard (ISO 8879) published in 1986 that prescribes a standard format for embedding descriptive markup within a document, as well as specifying a standard method for describing the structure of a document.

 
http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=16387
ftp://ftp.ifi.uio.no/pub/SGML/TEI/P3SG.DOC
http://www.arbortext.com/data/getting_started_with_SGML/getting_started_with_sgml.html
W3C Hypertext Markup Language HTML W3C
  A language for the exchange of scientific and other technical documents, suitable for use by non-document specialists, which addressed the problem of SGML complexity by specifying a small set of structural and semantic tags suitable for authoring relatively simple documents and providing support for hypertext, and which became the universal markup language used to create the World Wide Web.
  http://www.w3.org/MarkUp/
W3C Extensible Hypertext Markup Language XHTML W3C
  An extension of HTML 4.01 that defines a single namespace for html: http://www.w3.org/1999/xhtml, that relies on HTML 4.01 for the semantics and data types of elements and attributes, that defines XHTML 1.0 XHTML corresponding to XHTML strict, transitional and frameset DTDs, and that provides guidelines for authoring XHTML documents for delivery to existing Web browsers.
  http://www.w3.org/TR/xhtml1/#xhtml
http://www.w3.org/TR/xhtml-basic/
W3C Extensible Markup Language XML W3C
  A subset of SGML whose goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML and that has been designed for ease of implementation and for interoperability with both SGML and HTML.
  http://www.w3.org/TR/REC-xml
 
1.2 Metadata Resources
W3C Resource Description Framework RDF W3C
  A language designed to support the Semantic Web, in much the same way that HTML is the language that helped initiate the original Web; a framework for supporting resource description, or metadata (data about data), for the Web. RDF provides common structures that can be used for interoperable XML data exchange. RDF follows the W3C design principles of interoperability, evolution, and decentralization.
  http://www.w3.org/RDF/
JTC1 Metadata Registry Data Elements ISO/IEC JTC 1,
SC 32
  ISO 11179: A family of standards that specifies basic aspects of data element composition, including metadata and that applies to the formulation of data element representations and meaning as shared among people and machines; ISO 11179 solutions do not apply to the physical representation of data as bits and bytes at the machine level.
  http://www.diffuse.org/meta.html#ISO11179
http://www.iso.ch/iso/en/CombinedQueryResult.CombinedQueryResult?queryString=11179
OMG Unified Modeling Language UML OMG
UML A graphic language for visualizing, constructing and documenting the artifacts of software systems.
  http://www.uml.org/
DublinCore Dublin Core DC DCMI
  Metadata that is used to supplement existing methods for searching and indexing Web-based metadata, regardless of whether the corresponding resource is an electronic document or a "real" physical object; sponsored by the Dublin Core Metadata Initiative.
  http://dublincore.org/
 
1.3 Character Encoding
UNICODE UNIversal character enCODing system UNICODE UNICODE Consortium
 

An International Standard (fully synchronized with ISO/IEC 10646 in character repertoire and code points) that specifies a character coding system designed to support the worldwide interchange, processing, and display of texts written in the diverse languages of the modern world, as well as of texts written in classical and historical languages.

  http://www.unicode.org/unicode/uni2book/uc20ch1.html
http://www.w3.org/TR/unicode-xml/
 
1.4 Access Protocols and Interoperability
CORBA Common Object Request Broker Architecture CORBA Object Management
Group (OMG)
  An open, vendor-independent architecture and infrastructure that computer applications use to work together over networks
  http://www.corba.org/
SOAP Simple Object Access Protocol SOAP W3C
  A lightweight and simple XML-based protocol that is designed to exchange structured and typed information on the Web and whose purpose is to enable rich and automated Web services based on a shared and open Web infrastructure.
  http://www.w3.org/TR/SOAP/
http://www.develop.com/soap/soapfaq.htm#1
http://msdn.microsoft.com/library/default.asp?url=/nhp/Default.asp?contentid=28000523
 
2. Content Creation
 
2.1 Authoring
Proprietary Style Guides   Enterprises
  In-house style guides, usually targeted at specific text types, designed to provide uniform quality and maintain company image and either configured as standard, human-readable resources or as automated style-checkers.
  Perform a Web search to find links to many different types of model style guides that you can use or that you can use as a model for designing your own style guide.
Proprietary Single Source Authoring Solutions   Enterprises
Vendors
  In-house or vendor-supported SGML or XML-based systems whose purpose is to create modular text components that are maintained in a central text repository, possibly translated, used, and reused to generate a variety of texts and text types (e.g., manuals, online help, Webpages, etc.). 
  Perform Web search to view information on various vendor options for single-source authoring solutions.
ISO Quality Management Systems ISO 9000 ISO TC 176
  ISO 9000: Quality Management Systems; A family of standards which comprises a generic international reference for quality requirements in business to business dealings.
  http://www.iso.ch/iso/en/iso9000-14000/iso9000/selection_use/iso9000family.html
IEEE Software Documentation IEEE 1603 IEEE
  IEEE 1603:2001, Standard for Software User Documentation: A component of the IEEE family on Software Engineering which presents the minimum requirements for the structure, information content, and format of user documentation, including both printed and electronic documents used in the work environment by users of systems containing software.
  http://shop.ieee.org/store/product.asp?prodno=SS94976
 
2.2 Text and Content Markup
XLIFF XML Localisation Interchange File Format XLIFF OASIS
  A format whose purpose is to store localisable data and carry it from one step of the localisation process to the other, while allowing interoperability between tools; supersedes Open Tag and differs in features and purpose from TMX.
  http://www.oasis-open.org/committees/xliff/documents/xliff-specification.htm
Historical and related formats & projects
OpenTag Open Tag Open Tag opentag.com
  An earlier standard xml markup format for storing, manipulating, and using extracted text for localization purposes that is no longer under development and that has been superseded by XLIFF.
  http://www.opentag.com/xliff.htm#DiffWithOTF
 
3. Translation
DIN Translation Contracts DIN 2345 DIN
Normen-ausschuß
Terminologie
  DIN 2345, Ausgabe:1998-04, Übersetzungsaufträge (Edition 1998-04, Translation contracts:
DIN standard that specifies requirements for translation contracts, particularly regarding the source text, cooperation between all parties involved in the translation contract, selection of the translator, work procedures, the target text (translation), and revision of translations.
  http://www2.din.de/
http://www.doculine.com/news/1998/12_98/DIN2345.htm

Austrian Standards
Institute

Translation Contracts ÖNORM 1200 / 1201 ÖNORM TC ?
  ÖNORM D 1200/1201: Translation and interpretation services, Translation Services. Requirements of the service and the provision of the service; Translation and interpretation services, Translation services. Translation contracts.
   
ASTM Translation and Localization QA ASTM F15.48 ???? ASTM TC F15.48
  ASTM F15.48, Consumer-Oriented Guide to Quality Assurance in Translation and Localization : A guide designed to identify factors relevant to the quality of language translation and localization services for each phase of a project and to provide a framework within which the participants in a services agreement can define the specifications necessary to arrive at a product that satisfies defined customer needs. (Currently in ballot)
  http://accurapid.com/journal/11astm.htm
Translation Quality Metric SAE J2450 Translation Quality Metric Task Force
  SAE J2450:2001, Translation Quality Metric : SAE standard designed to develop an industry-wide metric for the evaluation of translation quality, the objective of which is to establish a consistent standard against which the translation quality of automotive service information can be objectively measured, regardless of the source or target language, or of how the translation is performed--i.e., human translation or machine translation.
  http://www.sae.org/technicalcommittees/j2450p1.htm
http://www.sae.org/servlets/productDetail?PROD_CD=J2450_200112&PROD_TYP=STD&COMMON_SUCCESS=TRUE
American Translators
Association
Framework for Error Marking   ATA Certification
Committee
  Critical Items List developed by the ATA Certification Committee consisting of 20+ error categories used in the evaluation of ATA certification exams. The framework is designed with the purpose of improving inter-rater reliability.
 

http://www.atanet.org

TMX Translation Memory eXchange TMX LISA/OSCAR
  An XML-based format designed to allow easier exchange of translation memory data between tools and/or translation vendors with little or no loss of critical data during the process.
  http://www.lisa.org/tmx/tmx.htm
Segmentation TM Segmentation Segmentation LISA/OSCAR
  A recently formed LISA/OSCAR work group whose task is to examine the different ways that localization tools segment text for translation and perform word counts, and to make recommendations on how to achieve industry standards in these areas.
  http://www.lisa.org/oscar/seg/
 
4.  Terminology and Lexicography Standards
 
4.1  ISO TC 37 Terminology Standards
ISO Terminology Standards TC 37
Standards
ISO TC 37
 

ISO 639:1988 Code for the representation of names of languages
ISO 639-2:1998
Code for the representation of names of languages - Part 2: Alpha-3 code
ISO CD 639-3:2003 Code for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages 
ISO NWI 639-4:2003 Code for the representation of names of languages - Part 4: Implementation guidelines and general principles for language coding
ISO NWI 639-5:2003 Code for the representation of names of languages - Part 5: Alpha-3 code for language families and groups
ISO NWI 639-6:2003 Code for the representation of names of languages - Part 6: Alpha-? code [for dialects] 
ISO 704:2000 Terminology work - Principles and methods
ISO 860:1996
Terminology work - Harmonization of concepts and terms
ISO 1087-1:2000
Terminology work - Vocabulary - Part 1: Theory and application
ISO 1087-2:2000
Terminology work - Vocabulary - Part 2: Computer applications
ISO 1951:1997
Lexicographical symbols particularly for use in classified defining vocabularies
ISO 6156:1987
Magnetic tape exchange format for terminological/lexicographical records (MATER) [Withdrawn]
ISO 10241:1992
Preparation and layout of international terminology standards
ISO 12199:2000(E)
Alphabetical ordering of multilingual terminological and lexicographical data represented in the Latin alphabet
ISO 12200:1999
Computer applications in terminology - Machine-readable terminology interchange format (MARTIF) - Negotiated interchange
ISO 12616:2002 Translation-oriented terminography
ISO/TR 12618:1994
Computer aids in terminology - Creation and use of terminological databases and text corpora [Withdrawn]
ISO 12620:1999
Computer applications in terminology - Data categories
ISO CD 12620-1:2003
 Defining parameters for specifying data categories for terminology collections and other language resources
ISO CD 12620-2:2003 Computer applications in terminology - Data categories for terminology collections
ISO CD 12620-3:2003 Computer applications in terminology - Data categories for electronic lexical resources 

Planned: Morpho-syntax, machine translation lexicons, a standard for registering the language codes as data categories to be used in the language resources metadata registry

ISO 15188:2001 Project management guidelines for terminology standardization

SC 4: Other language resources
Integration of TEI, EAGLES-ISLE, and other similar initiatives into the ISO process

  http://linux.infoterm.org/iso-e/i-iso.htm
 
4.2  Terminology and Lexicography Interchange
TBX TermBase eXchange TBX LISA/OSCAR
  An ISO-16642 compliant open XML-based standard format for exchanging terminological data while retaining complex embedding structures.
  http://www.lisa.org/tbx/
ISO Terminology Markup Framework TMF ISO TC 37/
SC 3/WG 3
  ISO 16642; a metamodel that defines underlying structures and mechanisms needed for the computer representation of terminological data that is designed to express interchange information between different TMLs (Terminology Markup Languages), but is not designed to serve as a TML itself.
  file:///C:/WINDOWS/Temporary%20Internet%20Files/Content.IE5/SHAVSDAV/256,1,ISO
ISO Lexical Resource Markup Framework LMF ISO ????? TC 37/
SC4
New work item proposal for ISO that defines underlying structures and mechanisms needed for computer representation of non-presentational lexical resource data designed to facilitate interoperability with terminological, NLP, and ontological resources.
No link currently available.
ISO Data Categories Dat-Cats ISO TC 37/SC 3/WG 1
  ISO 12620; a metadata registry that lists both open and closed data elements used in terminology management and documented by TC 37/SC 3/WG 1.
The ISO 12620 family of standards encompasses general guidelines for a global Data Category Registry (DCR) (Part 1), for terminology (Part 2), for electronic lexical resources Part 3), and will incorporate such thematic domains as morpho-syntactic markup, machine translation, etc. The DCR will be maintained as an XML-RDF resource by a formal Registration Authority.
  http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=2517
http://www.loria.fr/projets/SALT/
OLIF Open Lexicon Interchange Format OLIF OLIF Consortium
  A format that focuses on the interchange of data among lexbase resources from various machine translation systems.
  http://www.olif.net/
Historical and related formats & projects
TEI Terminology Interchange Format TEI-Term Text
Encoding
Initiative
  An attempt to define the encoding of literary and linguistic texts in electronic form using TEI headers, which are a SGML-based metadata format for the documentation of these electronic text, including TEI work group 13, which developed the TIF/E-TIF (Terminology Interchange Format) as a component of the original TEI project.
  http://www.tei-c.org/Vault/GL/P3/TE.htm
ISO MAchine-Readable Terminology Interchange Format MARTIF ISO TC 37/
SC 3/WG 3
 

ISO 12200 (TIF, E-TIF); MAchine-Readable Terminology Interchange Format; SGML terminology interchange format derived originally and significantly modified from TEI-Term, which later evolved into LISA/OSCAR TBX.

  http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=21174
http://www.ttt.org/clsframe/
SALT SALT (See below) XLT SALT

Crystal

Standards-based Access to multilingual NLP Lexicon and human-oriented Terminology resources, an EU/US-sponsored consortium, which developed XLT/DXLT, an eXchange format for Lex/Term-data, which later evolved into the TBX format.
  http://www.loria.fr/projets/SALT/

Geneter

Geneter (See below) Geneter Geneter
  Modèle GENErique pour la TERminologie; A terminology markup language that provides a generic model for SGML-based representation of terminological data that is designed to represent terminological data and to serve as an intermediate format between different applications and platforms.
  http://www.uhb.fr/Langues/Craie/balneo/demo_geneter.pl?langue=1
 
4.3 Controlled Language Standards
AECMA AECMA Simplified English AECMA SE AECMA & ATA
  A controlled English standard based on research of procedural texts in maintenance manuals by AECMA (European Association of Aerospace Industries) and ATA (American Air Transport Association) project groups, which is specified in the document PSC-85-16598 "A Guide for the Preparation of Aircraft Maintenance Documentation in the International Aerospace Maintenance Language". Also called: AECMA Simplified English Guide for the Preparation of Aircraft Maintenance Documentation.
  http://www.aecma.org/Publications.htm
Proprietary Controlled English CE Misc.
  A subset of a standard English that is restricted by formalised writing rules, e.g., Ogden's BASIC (British American Scientific International Commercial) English (http://www.marshallnet.com/~manor/basiceng/basiceng.html), Perkins Approved Clear English or PACE (Perkins Engines), Simplified English (Association Européenne des Constructeurs de Matériels Aéronautiques — AECMA), Français Rationalisé (Groupement des Industries Françaises Aéronautiques et Spatiales - GIFAS), Multinational Customized English or MCE (Xerox), EasyEnglish (IBM), Caterpillar Technical English or CTE, Controlled Automotive Service Language or CASL (General Motors), Attempto Controlled English (ACE) (University of Zurich), and Siemens-Dokumentationsdeutsch (SDD).
  http://www.controlledenglish.com/history.html
 
5. Knowledge Structures (Ontologies and other Semantic Mapping Frameworks)
W3C Web Ontology Language OWL W3C
The OWL Web Ontology Language is designed for use by applications that need to process the content of information instead of just presenting information to humans.

http://www.w3c.org/TR/owl-features

DAML DARPA Agent Markup Language DAML-OIL DARPA
OIL Ontology Inference Level OIL  
  A semantic markup language for Web resources that builds on earlier W3C standards such as RDF and RDF Schema, extends these languages with richer modeling primitives, provides modeling primitives commonly found in frame-based languages, and has a clean and well defined semantics based on description logics; the proposed backbone of the Semantic Web.
  http://www.daml.org/
http://www.darpa.mil/
http://dtsn.darpa.mil/ixo/daml.asp
http://www.ontoknowledge.org/oil/oilhome.shtml
JTC1 Topic Maps Topic Maps ISO/IEC JTC 1/
SC 34
  ISO/IEC 13250--Topic Maps; A set of information resources regarded by a topic map application as a bounded object set whose hub document is a topic map document conforming to the SGML architecture defined by ISO/IEC 13250, 2nd ed.
  http://www.topicmaps.org/
http://xml.coverpages.org/topicMaps.html
http://www.y12.doe.gov/sgml/sc34/document/0129.pdf
Historical and related formats & projects
SHOE Simple HTML Ontology Extensions SHOE PLUS Group
  An HTML-based knowledge representation language that is a superset of HTML which adds the tags necessary to embed arbitrary semantic data into Web pages.
  http://www.cs.umd.edu/projects/plus/SHOE/

XOL

XML-based Ontology interchange Language XOL Molecular Biology
Ontology WG
  A language for ontology exchange designed to provide a format for exchanging ontology definitions among a set of interested parties.
  http://xml.coverpages.org/xol.html
NCITS KIF Knowledge Interchange Format ANSI / NCITS
  A language designed for use in the interchange of knowledge among disparate computer systems (created by different programmers, at different times, in different languages, and so forth).
  http://logic.stanford.edu/kif/dpans.html
 
6. Corpus-related Formats
EAGLES/ISLE Corpus Encoding Standard CES EAGLES/ISLE
ISO/TC 37/ SC 4
  EAGLES/ISLE standard that specifies a minimal encoding level that corpora must achieve to be considered standardized in terms of descriptive representation (marking of structural and typographic information) as well as general architecture (so as to be maximally suited for use in a text database) and that also provides encoding specifications for linguistic annotation, together with a data architecture for linguistic corpora.
  http://www.cs.vassar.edu/CES/
EAGLES/ISLE XCES EAGLES/ISLE
ISO/TC 37/ SC 4
XML-based CES
  An instantiation of the Corpus Encoding Standard (CES) DTDs for linguistic EAGLES corpora, developed by the Department of Computer Science, Vassar College, and Equipe Langue et Dialogue, LORIA/CNRS.
  http://www.cs.vassar.edu/XCES/
MATE Multilingual Annotation Tools Engineering MATE EAGLES/ISLE
  A project that aims to facilitate re-use of language resources by addressing the problems of creating, acquiring, and maintaining spoken dialogue language corpora through the development of a standard for annotating resources and through the provision of tools which will make the processes of knowledge acquisition and extraction more efficient.
  http://mate.nis.sdu.dk/
 
7. Language and Locale Specification
ISO Alpha-2 Code 2-Letter
Language Code
ISO TC 37/
SC 2/WG 1
  ISO 639-1; Codes for the Representation of Names of Languages -- Alpha Code; International Standard that defines 2-letter codes for languages, primarily for use in terminology, lexicography and linguistics.
  http://lcweb.loc.gov/standards/iso639-2/langcodes.html
ISO Alpha-3 Code 3-Letter
Language Code
ISO 639/JAC
TC 46/SC 4
 

ISO 639-2; Codes for the Representation of Names of Languages -- Alpha 3 Code; a part of ISO 639 that provides two sets of three-letter alphabetic codes for the representation of names of languages, one for terminology applications and the other for bibliographic applications, with the variation determined by the criteria used for formulating them.
The set represented by ISO 639-2 is now expanding efficiently to include a variety of additional languages not included in the original collection. ISO 639-3 is slated to include the entire 6000+ language set recognized by Ethnolog, subject resolution of various conflicts concerning the grouping of some languages.

  http://lcweb.loc.gov/standards/iso639-2/langcodes.html
ETHNOLOGUE Languages of the World Ethnologue SIL
  A computerized database for the 6,809 languages listed in Ethnologue, Volume 1: Languages of the World, Fourteenth Edition, 2000, which identifies all the language names and their alternates, and all the dialect names and their alternates.
  http://www.ethnologue.com/ethno_docs/introduction.asp
ISO Country Names and Code Elements Country Codes ISO TC 46/
WG 2
  ISO 3166: Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes
  http://www.iso.org/iso/en/prods-services/iso3166ma/02iso-3166-code-lists/index.html
ISO Code for the representation of the names of scripts Script Code ISO TC 46/ SC 2
  ISO 15924: An International Standard that provides a code for the presentation of names of scripts which were devised for use in terminology, lexicography, and linguistics, but which may be used for any application requiring the expression of scripts in coded form.
  http://www.evertype.com/standards/iso15924/index.html
W3C *Locale Code xml:lang W3C
  A reserved attribute of XML to specify the language of a given content, generally consisting of a two-letter language code plus a two-letter country code, with the stipulation that the alpha-3 codes can be used in the absence of the two-letter codes (as per IETF RFC 3066).
  http://www.w3.org/TR/1998/REC-xml-19980210.html
http://www.w3.org/XML/xml-V10-2e-errata#E11
http://www.ietf.org/rfc/rfc3066.txt
IETF Language Tag Specification language
tag
IETF Network Working Group
  Specification for a language tag for use in cases where it is desired to indicate the language used in an information object, how to register values for use in this language tag, and a construct for matching such language tags.
  http://www.faqs.org/rfcs/rfc3066.html
POSIX standards POSIX
locales
PASC
IEEE, JTC1
A set of IEEE and ISO standards that define an interface between programs and operating systems designed for UNIX, but interfaced with other OS, such as Windows, etc., and which currently allows programmers to control the following parameters: language, country or region, collating sequence, character classification and case conversion, numeric formatting, currency, date & time format, code set (in the future, UNICODE), script, and possibly other aspects.
  http://std.dkuug.dk/JTC1/SC22/WG15/
http://osr5doc.ca.caldera.com:457/cgi-bin/man/man?setlocale+S
Global Microsoft National Language Support MS/NLS codes Microsoft Global
Software Development
  A set of APIs and tables for locale support, offered by all Microsoft Win32®-based systems.
  http://www.microsoft.com/globaldev/nlsweb/default.asp
http://www.microsoft.com/globaldev/gbl-gen/nls.asp#over
BIPM Coordinated Universal Time UTC BIPM
  The international time standard, which was formerly commonly referred to as Greenwich Meridian Time (GMT), whereby zero (0) hours UTC is midnight in Greenwich, England, which lies on the zero longitudinal meridian. Universal time is based on a 24 hour clock, therefore, afternoon hours such as 4 pm UTC are expressed as 16:00 UTC (sixteen hours, zero minutes).
  http://www.metas.ch/en/labors/4/zeitskalen.html
http://www.boulder.nist.gov/timefreq/pubs/bulletin/nistusno.htm
ISO ISO Date and Time Standard Date & Time ISO TC 154
  ISO 8601: Data elements and interchange formats -- Information interchange -- Representation of dates and times
  http://www.w3.org/TR/NOTE-datetime
http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=26780&ICS1=1&ICS2=140&ICS3=30
ISO BSI Currency Code Service Currency Code Currency
and funds
  ISO 4217; Codes for the representation of currencies and funds
  http://www.bsi-global.com/Technical+Information/Publications/_Publications/tig90x.doc
 

Return to top

Language Technology Standards by Sponsoring Body

 

Logo Description
   
AECMA
AECMA The European Association of Aerospace Industries, AECMA, represents the aerospace industry in Europe in all matters of common interest on the level of aircraft/systems, engines, equipment and components. Its objective is to enhance the competitive development of the whole sector. Members of the Association are the national aerospace associations of all 15 EU Member States and the Czech Republic, as well as the largest European aerospace companies, which form the Council of AECMA.
  http://www.aecma.org/
ANSI
ANSI The American National Standards Institute (ANSI) is a private, non-profit organization that administers and coordinates the U.S. voluntary standardization and conformity assessment system. The Institute's mission is to enhance both the global competitiveness of U.S. business and the U.S. quality of life by promoting and facilitating voluntary consensus standards and conformity assessment systems, and safeguarding their integrity.
  http://www.ansi.org/
ASTM
ASTM ASTM International is a not-for-profit organization that provides a global forum for the development and publication of voluntary consensus standards for materials, products, systems, and services. Formerly known as the American Society for Testing and Materials, ASTM International provides standards that are accepted and used in research and development, product testing, quality systems, and commercial transactions around the globe.
  http://www.astm.org/
BIPM
BIPM The task of the BIPM is to ensure world-wide uniformity of measurements and their traceability to the International System of Units (SI). It does this with the authority of the Convention of the Metre, a diplomatic treaty between fifty-one nations, and it operates through a series of Consultative Committees, whose members are the national metrology laboratories of the Member States of the Convention, and through its own laboratory work.
  http://www.bipm.org/
DARPA
DARPA DARPA is the Defense Advanced Research Projects Agency. Its mission is to maintain the technological superiority of the U.S. military and prevent technological surprise from harming our national security by sponsoring revolutionary, high-payoff research that bridges the gap between fundamental discoveries and their military use.
  http://www.darpa.mil/
DublinCore
DublinCore The Dublin Core Metadata Initiative (DCMI) is an open forum engaged in the development of interoperable online metadata standards that support a broad range of purposes and business models. DCMI's activities include consensus-driven working groups, global workshops, conferences, standards liaison, and educational efforts to promote widespread acceptance of metadata standards and practices.
  http://www.dublincore.org/
DIN
DIN DIN, the German Institute for Standardization, is a registered association, founded in 1917. Its head office is in Berlin. Since 1975 it has been recognized by the German government as the national standards body and represents German interests at international and European level.
  http://www2.din.de/
EAGLES/ISLE
EAGLES

ISLE
The Expert Advisory Group on Language Engineering Standards (EAGLES) is an initiative of the European Commission, within DG XIII Linguistic Research and Engineering programme, which aims to accelerate the provision of standards for:
  • Very large-scale language resources,
  • Means of manipulating such knowledge,
  • Means of assessing and evaluating resources, tools and products.

    ISLE (International Standards for Language Engineering) is both the name of a project and the name of an entire set of co-ordinated activities regarding the Human Language Technology (HLT) field. ISLE acts under the aegis of the EAGLES initiative. Its aim is to develop HLT standards within an international framework, in the context of the EU-US International Research Cooperation initiative.
  •   http://www.ilc.cnr.it/EAGLES/home.html
    http://www.ilc.cnr.it/EAGLES96/isle/ISLE_Home_Page.htm
    IANA
    IANA The Internet Assigned Numbers Authority is dedicated to preserving the central coordinating functions of the global Internet for the public good. The IANA keeps authoritative records concerning various numbers for other organizations. The record-keeping service IANA performs is called a registration service, and IANA serves as a registry. In addition to IP addresses, IANA also serves as a registry for a variety of Protocol Numbers and several other kinds of names and identifiers.
      http://www.iana.org/
    IEEE
    IEEE The IEEE (Eye-triple-E), or Institute of Electrical and Electronics Engineers, Inc., is a non-profit, technical professional association with members in 150 countries. It promotes the engineering process of creating, developing, integrating, sharing, and applying knowledge about electro and information technologies and sciences through its technical publishing, conferences and consensus-based standards activities.
      http://www.ieee.org/
    IETF
    IETF The Internet Engineering Task Force (IETF) is a large open international community of network designers, operators, vendors, and researchers concerned with the evolution of the Internet architecture and the smooth operation of the Internet. It is open to any interested individual.
      http://www.ietf.org/
    ISO
    ISO "ISO" is the official international short form of the International Organization for Standardization. The organization is a network of the national standards institutes of 147 countries, on the basis of one member per country, with a Central Secretariat in Geneva, Switzerland, that coordinates the system. ISO standards are technical agreements which provide the framework for compatible technology worldwide. In all, there are more than 2 850 ISO technical groups in which some 30 000 experts participate annually to develop ISO standards.
      http://www.iso.org
    LISA/OSCAR
    LISA

    SEGMENTATION
    The Localization Industry Standards Association (LISA) is the premier organization for the GILT (Globalization, Internationalization, Localization, and Translation) business communities. LISA Initiatives, Conferences, and Training Programs help companies implement cost-effective and efficient international business model.

    The mission of the OSCAR Segmentation Working Group, which is a part of LISA, is to define a common, open standard for documenting the segmentation rules used by any CAT tool for parsing formatted text into translation segments as a part of the effort to maximize the portability of translation memory data between CAT tools and limit the loss of any leverage.
      http://www.lisa.org/oscar/seg/
    http://www.lisa.org/
    MICROSOFT
    MICROSOFT Founded in 1975, Microsoft is the worldwide leader in software, services and Internet technologies for personal and business computing. Its corporate mission is to enable people and businesses throughout the world to realize their full potential.
      http://www.microsoft.com/
    INCITS
    INCITS

    NCITS
    The mission of the InterNational Committee for Information Technology Standards (INCITS) is to create and maintain market-driven, voluntary consensus IT standards. INCITS is accredited by, and operates under rules approved by, the American National Standards Institute (ANSI). These rules are designed to ensure that voluntary standards are developed by the consensus of directly and materially affected interests.
      http://www.incits.org/
    OASIS
    OASIS The mission of OASIS is to drive the development, convergence, and adoption of structured information standards in the areas of e-business, web services, etc. OASIS members have formed Technical Committees (TCs) in a number of areas including the following: Horizontal and e-business framework, Web Services, Security, Public Sector, Vertical industry applications.
      http://www.oasis-open.org/
    OLIF
    OLIF Established in March 2000, the OLIF Consortium had its first meeting on April 6, 2000 at SAP headquarters in Walldorf, Germany. The purpose of the consortium, which is headed by SAP, is to generate and support a lexical/terminological interchange standard based on the OLIF prototype (Open Lexicon Interchange Format).
      http://www.olif.net/
    OMG
    OMG The Object Management Group (OMG) is an open membership, not-for-profit consortium that produces and maintains computer industry specifications for interoperable enterprise applications. Its membership includes virtually every large company in the computer industry, and hundreds of smaller ones. Their flagship specification is the multi-platform Model Driven Architecture (MDA). CORBA is OMG's own middleware platform.
      http://www.omg.org/
    OpenTag
    OpenTag This site is dedicated to the tools and technologies used in the localization of software, on-line help and documentation. Its aim is to provide information or data that could help to streamline localization processes.
      http://www.opentag.com/
    SAE
    SAE The Society of Automotive Engineers is a resource for technical information and expertise used in designing, building, maintaining, and operating self-propelled vehicles for use on land or sea, in air or space. Its technical committees write more new aerospace and automotive engineering standards than any other standards-writing organization in the world.
      http://www.sae.org/
    SALT
    SALT Membership in the Society for Applied Learning Technology is oriented to professionals in the field of instructional technology. It is a professional society with classes of membership keyed to the interest and experience of the individual. Its conferences cover areas such as distance learning, interactive multimedia in education and training, development of interactive instruction materials, performance support systems applications in education and training, interactive instruction delivery, and information literacy.
      http://www.salt.org/
    SIL
    SIL By facilitating language-based development, SIL International (originally the Summer Institute of Linguistics) serves the peoples of the world through research, translation, and literacy. The SIL sponsored Ethnologue database has been an active research project for more than fifty years and is widely acknowledged as the most comprehensive listing of information about the currently-known languages of the world.
      http://www.sil.org/
    TEI
    TEI The TEI Consortium (TEI-C) has been established to maintain, develop, and further the use of the TEI standard. The goals of the consortium are to establish and maintain a home for the Text Encoding Initiative (TEI) in the form of a permanent organizational structure, to ensure the continued funding of TEI-C activities (for example: editorial maintenance and development of the TEI guidelines and DTD, training and outreach activities, and services to members), and to create and maintain a governance structure for the TEI-C with broad representation of TEI user-communities.
      http://www.tei-c.org/
    UNICODE
    UNICODE The Unicode Consortium is a non-profit organization founded to develop, extend and promote use of the Unicode Standard, which specifies the representation of text in modern software products and standards. The membership of the consortium represents a broad spectrum of corporations and organizations in the computer and information processing industry.
      http://www.unicode.org/
    W3C
    W3C The World Wide Web Consortium (W3C) develops interoperable technologies (specifications, guidelines, software, and tools) to lead the Web to its full potential. W3C is a forum for information, commerce, communication, and collective understanding.
      http://www.w3c.org/
     

    Return to top



    © Sue Ellen Wright, 2004