Support from Bibliographic Tools to Build an Organizational Taxonomy for Navigation: Use of a General Classification Scheme and Domain Thesauri ...
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
256 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... Support from Bibliographic Tools to Build an Organizational Taxonomy for Navigation: Use of a General Classification Scheme and Domain Thesauri Zhonghong Wang*, Abdus Sattar Chaudhry**, and Christopher Khoo *** * 625 Meadowlark Street, Troy, IL 62294 ** Department of Library and Information Science, College of Social Science, Kuwait University, Kuwait 13060 *** Wee Kim Wee School of Communication & Information, Nanyang Technological University, 31 Nanyang Link, Singapore 637718 Dr. Wang obtained her PhD degree from the Nanyang Technological University, Singapore, in 2010. Before joining NTU, she had been a librarian in university libraries in China for more than 10 years. She currently is working in a public library in the United States. Her research interests are in the area of taxonomies, ontologies, metadata, and social tagging. Dr. Chaudhry obtained his PhD from the University of Illinois at Urbana-Champaign in 1985. He was Head of Division of Information Studies, Wee Kim Wee School of Communication and Informa- tion, Nanyang Technological University, Singapore, from 2003 to 2008. Before joining NTU, Dr. Chaudhry had held teaching and professional positions in USA, Saudi Arabia, Pakistan, and Malaysia. He is currently in the College of Social Sciences, Kuwait University. His current research focuses on information organization and knowledge management. Dr. Khoo obtained his PhD at Syracuse University and his MSc in Library & Information Science at the University of Illinois, Urbana-Champaign. He is the head of the Division of Information Studies at Nanyang Technological University, Singapore, where he teaches courses in knowledge organization, information architecture, data mining and Web-based information systems. He has also worked for several years as a science reference librarian, cataloger and online information searcher at the National University of Singapore Libraries. His main research interests are in text mining (information extrac- tion and text summarization), medical decision support system, knowledge organization, and human categorization behavior. Wang, Zhonghong, Chaudhry, Abdus Sattar, and Khoo, Christopher. Support from Bibliographic Tools to Build an Organizational Taxonomy for Navigation: Use of a General Classification Scheme and Domain Thesauri. Knowledge Organization, 37(4), 256-269. 25 references. ABSTRACT: A study was conducted to investigate the capability of a general classification scheme and domain thesauri to support the construction of an organizational taxonomy to be used for naviga- tion, and to develop steps and guidelines for constructing the hierarchical structure and categories. The study was conducted in the context of a graduate department in information studies in Singapore that offers Master’s and PhD programs in information studies, information systems, and knowledge management. An organizational taxonomy, called Information Studies Taxonomy, was built for learning, teaching and research tasks of the department using the Dewey Decimal Classification and three domain thesauri (ASIS&T, LISA, and ERIC). The support and difficulties of using the general classifi- https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 257 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... cation scheme and domain thesauri were identified in the taxonomy development process. Steps and guidelines for construct- ing the hierarchical structure and categories were developed based on problems encountered in using the sources. 1.0 Introduction 2003) was composed of 50 categories that indicated the narrow scope of the project. In the SeSDL edu- Taxonomies are increasingly being used to organize cational taxonomy, the “subjects” facet was based on content within organizations and to support naviga- the ten main classes of DDC, and the other facets tion of web sites or digital repositories. Several writ- used the British Education Thesaurus as one source ers have advocated using a top-down approach and of categories. However, no details of the develop- classification schemes and thesauri as sources for ment process have been reported. The prototype of building organizational taxonomies (Iyer 1995; Ait- the taxonomy was accessible on the Internet. In chison et al. 2000; Conway and Sligar 2002; Cisco other words, an empirical study of building an or- and Jackson 2005). This would allow the taxonomies ganizational taxonomy by using classification to leverage on the strengths and principles underly- schemes and thesauri is still lacking. ing existing classification schemes and thesauri We conducted an empirical study of building an (McGregor 2005; Saeed and Chaudhry 2002), and organizational taxonomy using a general classifica- enable the taxonomies to be developed with less ef- tion scheme and domain thesauri keeping in view the fort than starting from scratch (Wyllie 2005). At the previous taxonomy projects. The objectives of the same time, it has been pointed out that organiza- study are: 1) to review the capability of a general tional taxonomies are different from classification classification scheme and domain thesauri in sup- schemes and thesauri in scope, components and roles porting an organizational taxonomy that is used for (Wang et al. 2006). The coverage of organizational navigation; and, 2) to develop steps and guidelines taxonomies depends more on the activities of the or- for constructing the hierarchical structure and cate- ganizations and interests of the stakeholders. The hi- gories. We hope that the report of advantages and erarchical structure used for navigation is expected problems we encountered in using the general classi- to be more flexible and simpler; and the categories of fication scheme and domain thesauri will provide a taxonomies must be intuitive to intended users. The lesson for using sources of bibliographic tools. We construction of an effective organizational taxonomy also hope that the steps and guidelines we developed that supports navigation needs to incorporate the will be helpful for other organizations to build tax- organizational context and take into consideration onomies. its navigational role while using components of clas- sification schemes and thesauri. 2.0 Research Approach Several taxonomy projects (McGregor 2005; Saeed and Chaudhry 2002; Bertolucci 2003) have The empirical study was conducted in the context of used classification schemes and thesauri to build tax- an academic organization (a graduate school) in the onomies. These projects demonstrated that biblio- information studies domain, the Division of Infor- graphic tools have the potential of providing the mation Studies, School of Communication and In- knowledge context and terms of categories (Saeed formation, Nanyang Technological University, Sin- and Chaudhry 2002). Taxonomies built based on gapore. The Division has 15 full-time faculty mem- them would share the consistency of the classifica- bers and nearly 500 students, and offers three Mas- tion schemes and controlled vocabularies (McGregor ter’s programs by coursework: MSc in Information 2005). But these projects did not incorporate the or- Studies, Information Systems, and Knowledge Man- ganizational context, the activities of the organiza- agement, and Master’s and PhD programs by re- tions, and interests of the stakeholders, in the tax- search. The students in the MSc coursework pro- onomy development process. The organizational grams focus on courses and project reports in the context was missing in the medical taxonomy devel- Critical Inquiry course that involves small group re- opment process that used MeSH (McGregor 2005). search projects. The students in the research pro- The pilot study in the computer science domain of grams focus on research projects and theses. The Di- using Dewey Decimal Classification and IEEE Web vision has four main research groups: Information Thesaurus (Saeed and Chaudhry 2002) did not de- and Knowledge Management, Knowledge Organiza- fine the application scope of the taxonomy. The tion and Discovery, Information Retrieval and Digi- Snoopy taxonomy built using DDC (Bertolucci tal Libraries, and User and Usability Studies. We se- https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
258 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... lected the Division as an academic organization be- descriptions on websites of LIS schools), sources cause it has explicit goals and divisions of people that from relevant professional associations (e.g., IFLA are compatible with the two essential features of Guidelines for Professional Library/Information “goal-oriented” and “coordinated human activities Educational Programs 2000), and relevant domain toward the common goal” in slightly different defi- taxonomies (Hawkins et al. 2003; Mentzas 1994; nitions of organizations (Barnard 1938; Schein 1970; Doke and Barrier 1994; Cheung et al. 2005) in in- McAuley et al. 2007), and a considerable scale in li- formation systems and knowledge management were brary and information science education. also selected. Keeping in view suggestions in the lit- Dewey Decimal Classification (DDC) and three erature (Cheuk 2002; Hunter 2005; Pack 2002; Bat- domain thesauri, ASIS&T, LISA, and ERIC thesauri, ley 2005; Raschen 2006; Dickson 2008), we made an were chosen as sources. DDC was selected because effort to incorporate the stakeholders’ interests and its structure makes it easy to navigate and it has been perspectives in the construction of the hierarchical used in previous projects related to navigation (Saeed structure and categories by employing relevant sour- and Chaudhry 2002; Vizine-Goetz 2002). The two ces. We employed those additional sources when we thesauri (ASIS&T and LISA) in the library and in- found that DDC and the domain thesauri were not formation science area, and the ERIC education the- adequate. As recommended by Wyllie (2005), Lambe saurus, were selected based on their relevance to the (2007), Dickson (2008), Singhal and Nath (2008), in subject coverage of the taxonomy. the last construction step, we delivered the taxon- An organizational taxonomy, called Information omy draft to 11 stakeholders for review. Studies Taxonomy, was built for the Division. We de- signed three phases to develop the organizational tax- 3.0 Information Studies Taxonomy onomy keeping in view guidelines suggested in the lit- erature (Roberts-Witt 2000; Conway and Sligar 2002; Figure 1 illustrates the objectives, roles, and intended Choksy 2006, Lambe 2007; Sharma et al. 2008). The users of the taxonomy. The taxonomy was expected to first phase is taxonomy needs identification. We exam- support the learning/teaching and research tasks in the ined the goals, major tasks of the Division, and the Division. It focuses on two groups of users: graduate major stakeholders across the tasks. Interviews with students and instructors. The taxonomy will support 17 stakeholders were conducted to investigate the navigation and knowledge discovery of a digital re- stakeholders’ tasks, created knowledge assets, and pository that is relevant to the students and instruc- problems encountered in locating information re- tors’ learning/teaching and research tasks, such as sources for performing tasks. The use of existing course materials, research project reports, disserta- knowledge organization systems in the Division tions, and so on. intranet was also examined. The second phase is tax- The taxonomy draft comprises 7 facets and about onomy design. We determined the taxonomy objec- 540 categories. The subject facet comprises about tives, roles, target users, organization scheme (facets), 440 categories ranging from 2 to 5 levels. A full list- subject coverage, and target content based on prob- ing of the taxonomy prototype with references and lems that the taxonomy aimed to address, the activi- sources of labels has been reported in a previous pa- ties of the Division, and the needs of stakeholders. per (Wang et al. 2008). A brief display of the 7 facets The last phase is the taxonomy construction. and the 12 main categories (top-level) of the subject We constructed the taxonomy with a focus on the facet is shown in Fig. 2. subject facet, keeping in view the research objectives. The hierarchical structure and categories of the sub- 4.0 Support of a General Classification Scheme ject facet was manually constructed via a combina- and Domain Thesauri tion of top-down and bottom-up approach and with a focus on the top-down. The construction of the 4.1 A General Classification Scheme (DDC) hierarchical structure started from the top-level, the main categories, high-level categories (level 2, 3), and We had assumed that most of the main categories low-level categories (level 4, 5). In addition to DDC (top-level of the subject facet) could be selected or and the domain thesauri, sources related to the tasks adapted from DDC. But it is not often the case that of the stakeholders such as course materials and staff the subject coverage of an organizational taxonomy publications, sources from the community of library falls in the main classes or sub-classes of a general and information science schools (LIS) (e.g., course classification scheme, and the main categories repre- https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 259 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... Figure 1. Objectives, roles, and intended users of the Information Studies Taxonomy Figure 2. Overview of the Information Studies Taxonomy https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
260 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... senting the subject coverage can be selected. The why. First, some areas did not fall in the main classes subject coverage of this taxonomy that was deter- of DDC. For example, no specific classes were re- mined based on the programs, research groups in the lated to the areas of archives management, document Division, and the tasks of the stakeholders, was not management, knowledge management, and scholarly in line with the ten main classes and sub-classes in writing. Second, DDC represented some areas in one the library science and computer science schedule of class but could not provide detailed structures, for DDC. The 12 main categories of the subject facet example, library and information science education previously listed were not selected from DDC. (020.7), information professionals (020.9), and re- We also found that not all high-level categories search methodologies (001.4). These two situations (level 2, 3) within the main categories could be se- are probably typical because organizational taxono- lected from DDC. The high-level categories within 5 mies are different from general classification sche- of the 12 main categories (less than 50%) were se- mes in the nature of the subject coverage, and tax- lected from DDC. Table 1 lists the 5 main categories onomies used for navigation would require more de- and relevant DDC classes. The 5 main categories were tailed categories for tagging resources other than fairly similar to relevant sub-classes of DDC. For ex- book collection. Another possible situation is that ample, Information Institutions is similar to 026-027 the structures provided by the classification schemes Library and Information Sciences—Specific Kinds of might not be compatible with the perspectives of the Institutions; The Information Industry to 338.1-338.4 intended users, as experienced by Bertolucci (2003). Economics—Production—Specific Kinds of Indus- For example, the structure of the 025.04 class pro- tries; and high-level categories within the two main vided by DDC was not adopted because it organized categories of Collection Management and User Ser- types of information retrieval systems by subjects vices, and Information and Knowledge Organization, and persons. Such a structure would not fit needs of were selected from the 025 Operations of Libraries, the students and instructors for learning or teaching Archives, and Information Centers. and research in the area. Our findings suggested a general classification scheme would not be sufficient Main Categories (relevant) DDC to construct the hierarchical structure. Information Institutions 026, 027 Within the 5 relevant main categories, 15 out of Collection Management and 025.2, 025.5, 025.6, 29 (52%) categories at level 2 and 44 out of 106 User Services 028 (41.5%) categories at level 3 were identified from Information and Knowledge DDC. Table 2 lists examples of the categories. These 025.3, 025.4 Organization categories were identified from classes or relative in- 004, 005, 006, Information Technologies dex terms of the DDC. For example, within the 384.1-384.6 338.47001- main category of Information Institutions, the three The Information Industry categories at level 2, Archive, Libraries, and Informa- 338.47999 Main Categories / tion Centers, were identified from the 026 and 027 Reasons hierarchies (non-relevant) classes. Also, within the main category of Collection Information and Knowledge Management and User Services, the three categories Management / Archives man- No related classes at level 2 and some at level 3 were identified from the agement 025, 028 classes, and Relative Index terms. Not compatible Information Searching and Re- In addition to the high-level categories, we ob- with the users’ per- trieval / Information storage served that general classification schemes might pro- spectives and retrieval systems (025.04, 025.06) vide support for the low-level categories, especially The Information Profession No detailed struc- those based on the “genus/species” division. Within ture (020.9) the 5 relevant main categories, lower categories at No detailed struc- level 4 and 5 (14 out of 31 hierarchies) were identi- Education and Training ture (020.07) fied from the DDC: 9 of the 14 hierarchies are based Table 1. Relevant main categories and examples of non- on the “genus/species” division. For example, the relevant main categories lower categories within the hierarchy of Classifica- tion Schemes, Controlled Vocabularies, Special Ma- High-level categories within other 7 main categories terial Cataloging, which are based on the “ge- were not from the DDC in three situations. Table 1 nus/species” division, were identified from the 025.3 lists examples of the main categories and the reasons and 025.4 classes. https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 261 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... Table 2. Examples of categories identified from DDC However, we found that the DDC might not fully not be from DDC because DDC used only one class support the high-level categories within the 5 relevant (025.30285) to represent this area. The above two si- main categories; while it would play a major role. In tuations could be typical because organizational tax- this case, about half of the high-level categories within onomies and general classification schemes are differ- the 5 main categories were not from DDC. These ent in the nature of the subject coverage and applica- categories were not from the DDC in three situations. tions. The third situation was that DDC did not cover The first situation was that it was difficult to make use some new concepts, such as, metadata and social tag- of the discipline-based main classes, to represent cate- ging. The high-level category of Resource Description gories within the main category of Information Tech- within the main category of Information and Knowl- nologies from the 000 schedule that focused on com- edge Organization was thereby added from the Divi- puter science. Similarly, categories within Archive and sion’s course materials to accommodate the new con- Information Centers were selected from the ASIS&T cepts. Similarly, categories such as Ontologies, Cata- and LISA thesaurus because the 020 schedule focused loging Outsourcing, Semantic Networks, and Mobile more on librarianship. The second situation was that it Communications were added from the thesauri. Also, was difficult to fully make use of classes that allows categories could be added for the purpose of adding number-building, for example, to identify categories the taxonomy features. For example, the category of within the main category of The Information Industry Collection Measurement was added within the hierar- from the 338.47001-338.47999 class. Similarly, lower chy of Collection Development to make the hierarchy categories in the hierarchy of Computer Applications comprehensive and consistent with other hierarchies. in Information and Knowledge Organization could The above findings, as previously pointed out, indi- https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
262 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... Table 3. Support of DDC and the domain thesauri for the Information Studies Taxonomy cated that more than a general classification scheme management were not from the thesauri because had to be employed to construct the hierarchical they treated these areas from a broad perspective, structure of organizational taxonomy. such as information science or knowledge, and these A general classification scheme may not support the concepts were used in the Division more from the division criterion that selects categories at the same organizational communication perspective. Com- level for organizational taxonomies. General facets pound terms, such as archival collection develop- such as subjects, geography, and persons used in DDC ment and audiovisual material cataloging, were nec- were not appropriate for the taxonomy with a narrow essary for the hierarchical structure to support navi- coverage. Table 3 summarizes the support of compo- gation. Other terms, such as knowledge management nents of DDC for the organizational taxonomy. professionals, knowledge management education, li- brary and information science schools, and informa- 4.2 Domain Thesauri tion system development methodologies, reflected the organization as well as the taxonomy domain. As we expected, most categories (concepts and The above findings suggest that more than the do- terms) of the subject facet were from the three do- main thesauri had to be used to collect category main thesauri. But we found that they were not suf- terms and concepts for the organizational taxonomy. ficient for reflecting the organization, interests of We observed that the hierarchy of terms in a the- the stakeholders, and features for navigation. About saurus has the potential for supporting high-level cate- 16.6%, 71 out of 427 categories, were not from the gories. For example, the five categories at level 2 thesauri. These categories can be grouped into new within the main category of The Information Society concepts, compound terms, and terms particularly were identified from the Hierarchical Index of related to the taxonomy domain and the organiza- ASIS&T thesaurus. We found that the term relation- tion. Table 4 lists examples of these categories. New ships of thesauri were helpful for identifying low-level concepts, such as media resource centers, collabora- categories. Most relevant low-level categories were tive tagging, mobile information retrieval systems, identified from the term relationships of the thesauri. and digital watermarking, reflected the interests of A small number of low-level categories were not from the stakeholders. Among the new concepts, terms in them in two situations related to the scope of the the area of information management and knowledge thesauri and division’s interior for creating narrow https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 263 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... terms. For example, narrow terms provided by the 5.0 Difficulties Encountered ERIC thesaurus in the area of quantitative research methodologies were not adopted because they were The major difficulties we encountered involved par- not in line with the “genus/species” division used in tial support of DDC and domain thesauri, manipula- the hierarchy. As concluded by Saeed and Chaudhry tion of multiple sources, and incorporation of stake- (2002), we found that term relationships of thesauri holders’ interests and perspectives in the construc- were also helpful for identifying the scope of terms. tion of the hierarchical structure and categories. The For example, as previously mentioned, the terms in main categories and hierarchical structures within the area of information and knowledge management some main categories had to be constructed without provided by ASIS&T and LISA thesauri were not the help of the DDC. Concepts and terms of catego- adopted because they were not appropriate for the or- ries had to be collected from sources more than the ganizational context. The scope of these terms was domain thesauri. identified by their term relationships. Table 4 lists the The selected sources (classification schemes and support of components of the domain thesauri for the thesauri) may represent the same concepts from dif- organizational taxonomy. ferent contexts and using different terms. For exam- ple, for the concept of classification schemes, the New concepts – Media resource centers DDC used it in the librarianship context; the ASIS&T – Collaborative tagging thesaurus represented it by taxonomies. And for the – Mobile information retrieval concept of automatic thesaurus generation, the systems – Digital watermarking ASIS&T thesaurus chose automatic taxonomy genera- – Mobile commerce tion; and the LISA thesaurus combined the two terms – Information development of automatic construction and construction of – Information audit thesauri. Also, vocabularies from the 020 schedule of – Information distribution DDC focused more on librarianship, and the terms of – Information sharing the ERIC thesaurus focused on general education. – Information utilization – Knowledge audit The sources might provide different structures/ – Knowledge development term relationships for the same terms/concepts. For – Knowledge capture example, for the concept of ontologies, DDC re- – Knowledge sharing flected it using the 006.33 class of knowledge-based – Knowledge utilization systems; the ASIS&T treated it as a narrow term of Compound terms – Archival collection develop- controlled vocabularies; and the LISA thesaurus re- ment lated it to semantic web, controlled vocabularies, and – Electronic collection develop- ment thesauri. Also, the term relationships in a thesaurus – Audiovisual collection deve- may not be rigorously applied. For example, the LISA lopment thesaurus put the three terms of archival description, – Audiovisual material cataloging archives law, and national archives that are not at the – Cartographic material catalo- same conceptual level as narrower terms of archives. ging – Digital resource cataloging Terms related to – Knowledge management pro- 6.0 Steps and Guidelines for Constructing the organization fessionals the Hierarchical Structure and Categories – Information science & systems education 6.1 Steps – Knowledge management edu- cation We developed the steps for addressing the difficulties – Library and Information Sci- ence Schools we encountered in using the general classification – Information system develop- scheme and domain thesauri. We used multiple sour- ment methodologies ces in addition to the DDC and domain thesauri. In – Oral presentation skills particular, sources from the organization were used Table 4. Examples of categories from sources other than to reflect the stakeholders’ interests and perspectives the three thesauri such as in collecting the concepts and terms of cate- gories, and determining the mapping of low-level ca- tegories with high-level categories. Sources related to https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
264 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... the professional association (IFLA) and the com- tent as a basis to select terms from the MeSH head- munity (library and information science schools) ings. We designed a term list representing the stake- were employed to reflect the taxonomy domain and holders’ interests to select terms. The selection is in for determining the main categories. We designed the three steps as follows: steps keeping in mind the need to manipulate multi- ple sources and incorporate the stakeholders’ inter- 1. Create a list of concepts and terms related to the ests and perspectives. The steps involve constructing stakeholders’ interests by selecting and consoli- facets, main categories, category concepts and terms, dating terms from sources related to the stake- hierarchies, and labels of categories. holders’ tasks. 2. Select terms from the general classification 6.1.1 Selecting Facets scheme (DDC, class captions, and Relative Index terms), domain thesauri, and domain taxonomies The facets were not selected from the DDC or the based on the relevance to terms in the lists. domain thesauri. They were determined based on the 3. Consolidate concepts and terms from different taxonomy application context. The specific steps are sources. as follows: 6.1.4 Constructing the Hierarchies 1. Select facets from the application context of the taxonomy, such as tasks and roles of the stake- Saeed and Chaudhry (2002) proposed three steps for holders, and types of target content. constructing the hierarchical structure using classifica- 2. Create labels for facets. tion schemes and domain thesauri: selecting hierar- chies from the classification schemes, selecting terms 6.1.2 Determining the Main Categories (Level 1) from domain thesauri, and mapping the selected terms of the Subject Facet into the hierarchies. Based on their proposal, we de- signed four steps to construct the hierarchies. In addi- We designed the main categories as a separate step tion to the three steps of high-level categories, low- because they are at the top-level, represent the sub- level categories, and mapping, we inserted a step of di- ject coverage of the taxonomy, and general classifica- vision criteria to create neat categories at the same tion schemes or domain thesauri are not expected to level. The specific steps are as follows: be useful in specifying the main categories. The spe- cific steps are as follows: – High-level Categories (Level 2) 1. Based on relevance to the main categories, iden- 1. Identify major areas and concepts of the stake- tify and reconsolidate the high-level categories holders’ interests by reviewing sources related to (concepts and terms) from structures/term rela- the stakeholders’ tasks. tionships of the general classification scheme 2. Select the main categories to cover concepts of in- (DDC), the Hierarchical Index of the thesaurus terest and subject areas from industry/community (ASIS&T), and relevant domain thesauri (mainly sources (documents from professional associations Information Science Taxonomy). such as IFLA Guidelines, course descriptions from 2. Review whether the selected high-level catego- library and information science school websites), ries cover the lower category terms. Create new and domain taxonomies (e.g. Information Science high-level categories based on the lower cate- Taxonomy). gory terms when the selected high-level catego- 3. Create additional main categories to cover concepts ries cannot cover them. of interest and subject areas not found in commu- nity/industry sources and domain taxonomies. – Division Criteria (To Create Categories at the 4. Select labels from the sources and construct labels Same Level) for main categories not found in the sources. 3. Determine the division criteria by identifying knowledge structures inherent in sources in the 6.1.3 Collecting Category Concepts and Terms organization (e.g. course lectures). In the medical taxonomy project, McGregor (2005) used a term list representing the online journal con- https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 265 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... – Low-level Categories (Level 3) tive index terms) from the general classification 4. Identify and reconsolidate the low-level catego- scheme (DDC) into simpler expressions and in a ries from the multiple sources using the high- style reflecting the taxonomy domain. level categories as the starting points. 4. Modify the terms from the thesauri into expres- 5. Determine the low-level categories based on sions fully reflecting concepts at the target hierar- the chosen division criteria. chical levels. 5. Create labels for higher-level categories when la- – Mapping low-level categories and expansion of bels cannot be found in the category terms. hierarchies (Level 4 or 5) 6. Format the labels according to the thesaurus con- 6. Map the low-level categories to the main cate- struction standard (ANSI/NISO Z39.19-2005 gories and high-level categories by identifying standard). knowledge structures inherent in sources in the 7. Organize the labels of categories at the same level organization (e.g. course lectures). alphabetically (e.g. “genus/species”) or logically 7. Build cross references for categories that can be (e.g. “aspects” and “procedure”) based on the divi- mapped into more than one perspective or can- sion criteria used, group categories based on the not be mapped into the ideal “hierarchies”. same division criterion together when more than 8. Expand the hierarchies by further identifying one division is employed at the same level. low-level categories and mapping them to the 8. Build UF (used for) references used for category main categories and higher categories. terms that were not chosen as labels. 9. Balance the levels of hierarchies within 5 levels. Table 5 lists examples of labels modified from DDC 6.1.5 Determining Labels of Categories or the domain thesauri, and the reasons why. We created guidelines for selecting labels from vari- 6.2 Guidelines ous terms that were collected from multiple sources. To support user navigation, labels should fully reflect We developed guidelines based on the difficulties we the concepts at hierarchical levels, be simple expres- encountered in using the general classification sche- sions, and be consistent. We reviewed whether the me and domain thesauri, and experience we had in terms were appropriate for the organization, stake- constructing the hierarchical structure and catego- holders, and at the target hierarchical levels. Similar ries. The specific guidelines are as follows: guidelines can be found in the literature, but we had to address aspects of labels and terms from different 6.2.1 Selecting Facets sources. The specific steps are as follows: 1. Labels of facets are not likely to be found in a 1. Select labels from category terms according to the general classification scheme or domain thesauri guidelines we created. because they are usually very broad concepts or a 2. Determine the scope of the labels (terms) based combination of concepts. They usually have to be on their uses in organization sources. home-created. 3. Modify the vocabularies (class captions and rela- Table 5. Examples of labels modified from DDC and the thesauri https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
266 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... 6.2.3 Constructing the Main Categories (level 1) 3. The high-level categories (concepts and of the subject facet terms) can be selected from classes at differ- ent levels as well as related relative terms of 1. The main categories are not likely to be found in a the general classification scheme. general classification scheme or domain thesauri. 4. The division criteria inherent in the selected However, it depends on whether the subject cov- high-level categories should be reviewed to erage of the taxonomy matches the main classes ensure they are at the same conceptual level. of the general classification scheme or its sub- 5. The categories at level 2 should be based on classes. one division criteria. 2. When the subject coverage of the taxonomy does 6. When the high-level categories cannot be se- not match the main classes of the general classifi- lected from the general classification scheme cation scheme or its sub-classes, the main catego- as well as domain thesauri and relevant do- ries should be selected from sources in the or- main taxonomies, they have to be selected ganization, industry/community sources, and from sources in the organization or home- relevant domain taxonomies. created. 3. The size of the coverage and the width of the sub- 7. The selected high-level categories should be ject facet need to be considered to determine the annotated with their sources. number of main categories. – Hierarchies: division criteria (used to create 6.2.4 Selecting Category Concepts and Terms categories at the same level) 8. The division criteria are not likely to be found 1. Most of the category concepts and terms are in the general classification scheme. However, likely to be found in domain thesauri. However, it it depends on the size of the subject coverage depends on the availability of thesauri and the of the taxonomy. For a taxonomy involving scope of the thesauri. Some category concepts and several subjects and with a narrow coverage, terms are likely to be found in the general classifi- the division criteria have to be home-created. cation scheme (class captions) and relevant do- main taxonomies. – Hierarchies: low-level categories (level 3) 2. New concepts, compound concepts, and concepts 9. Most low-level categories (concepts and representing organization tasks and activities terms) are likely to be found in the domain (rather than academic subjects) may not be found thesauri. in the domain thesauri. 10. More than one division criterion can be used 3. The concepts and terms should be annotated with to determine the low-level categories, depend- their sources. ing on the number of categories and levels of 4. See references provided by the domain thesauri hierarchies. These divisions are expected to be should be kept with terms. at the same conceptual level. 6.2.5 Constructing Hierarchies – Hierarchies: mapping low-level categories and expansion of hierarchies (level 4 or 5) – Hierarchies: high-level categories (level 2) 11. The selection of categories at level 4 or 5 de- 1. If the main categories match the main class or pends on the stakeholders’ interests. Areas of sub-classes of the general classification high-level categories that are not the stake- scheme, about half the high-level categories holders’ major interests are not expected to (concepts and terms) are likely to be found in have many lower categories and levels. the general classification scheme. 12. The number of levels can be shortened by us- 2. When the main categories are not related or ing more than one division criterion at the relevant to the main classes or sub-classes of same level. the general classification scheme, the high- level categories are likely to be found in the – Determining Labels of Categories Hierarchical Index of a domain thesaurus and 13. Prefer labels from sources in the organization relevant domain taxonomies. 14. Choose the same terms to represent the same concepts. https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 267 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... 7.0 Conclusion DDC and domain thesauri. We did not cover ontolo- gies. For example, the GEM ontology seems to be a We conducted an empirical study of the extent that good starting point for collecting terms and term rela- the DDC and three relevant domain thesauri can be tionships in the field of education. The findings about used in developing an organizational taxonomy for an using DDC and the domain thesauri, as well as the academic department in the information studies do- steps and guidelines for constructing the hierarchical main. We started with the assumption that a general structure and categories, need to be validated in other classification scheme and several relevant domain types of organizations, knowledge domains, and using thesauri would provide excellent support for the sub- other knowledge organization systems. ject facet of an organizational taxonomy, particularly The prototype of taxonomy was implemented in in an academic organization. Some modifications to the University’s e-learning platform using a taxon- the scope of the concepts, terms (labels), or con- omy software, TLE-Equella version 3, to support cept/term relationships selected from these sources navigation. The evaluation of the effectiveness of the would of course be necessary to adjust them to the taxonomy for supporting end-users’ navigation has organizational context. But we found that the DDC been conducted to identify the problems of the tax- and the domain thesauri were far from being sufficient onomy construction steps. The evaluation methods for the organizational taxonomy. In particular, DDC we used and issues found in the evaluation results could not provide support for the top-level categories will be reported in a separate paper. of the taxonomy because the taxonomy is not disci- pline-based. Its subject coverage depends on the ac- References tivities of the organization, and the tasks of the stake- holders. The DDC could also not provide complete Aitchison, Jean et al. 2000. Thesaurus construction support for the high-level categories (level 2 or 3 of and use: A practical manual. (4th ed.). London: the subject facet) for the same reasons and because of Aslib IMI. the focus on supporting navigation. The two selected Barnard, Chester. I. 1938. The functions of the execu- domain thesauri in the area of library and information tive. Cambridge: Harvard University Press. science also could not provide support for concepts Batley, Susan. 2005. Classification in theory and prac- and terms in the area of information and knowledge tice. Oxford: Chandos Publishing. management because the organization treats them Bertolucci, Katherine. 2003. Happiness is taxonomy: from a different perspective – the perspective of or- Four structures for Snoopy. Information outlook ganizational communication. 7n3: 36-44. Organizational taxonomies are different from gen- Cheuk, W-Y B. 2002. Real-world taxonomy develop- eral classification schemes and domain thesauri in their ment: Creating a taxonomy that makes sense to application scope and navigation roles. We used addi- your employees. InsideKnowledge 5n6. Available tional sources, for example, course materials and re- http://communication.sbs.ohio-state.edu/sense- search publications of the organization to reflect the making/art/artabscheuk02kmtaxon.html (accessed stakeholders’ interests, and a domain taxonomy (In- July 20, 2007). formation Science Taxonomy) and those from the pro- Choksy, Carol E. B. 2006. 8 steps to develop a taxon- fessional association and the websites of sibling organi- omy. Information management journal Nov/Dec. zations to help with the top-level categories. The steps Available http://findarticles.com/p/articles/mi_qa we used to construct the hierarchical structure and 3937/is_200611/ai_n16871474 (accessed December categories took into consideration the necessity of ma- 4, 2006). nipulating multiple sources, the requirements for navi- Cisco, Susan. L. and Jackson, Wanda. K. 2005. Creat- gation, as well as requirements for a good taxonomy. ing order out of chaos with taxonomies {Elec- The guidelines we developed were based on the issues tronic version}. Information management journal encountered in the developing the taxonomy. May/June: 45-50. Available from EBSCO Aca- The findings of the study and the solutions imple- demic Search Premier database (accessed Decem- mented are limited to some extent to the context of ber 4, 2005). our study. For example, due to the domain the study, Conway, Susan and Sligar, Char. 2002. Chapter 6. we made use of partial schedules of DDC. Also, we Building taxonomies. In Susan Conway and Char used other knowledge organization systems such as Sligar. Unlocking knowledge assets. Redmond, domain taxonomies to complement the resources of Wash.: Microsoft Press, pp. 105-24. https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
268 Knowl. Org. 37(2010)No.4 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... Dickson, Ian. 2008. Taxonomy: Some guidelines for ef- Wang, Zhonghong et al. 2008. Using classification fective design of taxonomies. Available http://drupal. schemes and thesauri to build an organizational org/node/81589 (accessed January 31, 2009). taxonomy for organizing content and aiding navi- Hunter, Anthony. 2000. Taxonomies. Available http:// gation. Journal of documentation 64: 842-76. www.cs.ucl.ac.uk/staff/a.hunter/tradepress/tax. Wyllie, Jan. 2005. Taxonomies: Frameworks for corpo- html. (accessed July 10, 2005) rate knowledge (2nd ed.). London: Ark Group, in Iyer, Hemalata. 1995. Classificatory structures: Con- association with Inside Knowledge. cepts, relations and representation. Verlag, Frankfurt/ Main: INDEKS. Appendix: Lambe, Patrick. 2007. Organizing knowledge: Tax- Sources of the Information Studies Taxonomy onomies, knowledge and organizational effective- ness. Oxford: Chandos Publishing. Cheung, C. F. et al. 2005. A multi-facet taxonomy McAuley, John et al. 2007. Organization theory: chal- system with applications in unstructured knowl- lenges and perspectives. Harlow, England; New edge management {Electronic version}. Journal of York: Prentice Hall/Financial Times. knowledge management 9n6: 76-91. Available from McGregor, Bruce. 2005. Constructing a concise me- Emerald Full text database (accessed August 14, dical taxonomy. Journal of the Medical Library As- 2006). sociation 93: 121-23. Doke, E Reed. and Barrier, Tonya. 1994. An assess- Pack, Thomas. 2002. Taxonomy’s role in content man- ment of information systems taxonomies: Time to agement. Available http://www.econtentmag.com/ be re-evaluate? Journal of information technology Articles/ArticleReader.aspx?ArticleID=867 (ac- 9: 149-57. cessed July 10, 2005). Education Resources Information Center. U.S. De- Raschen, Bill. 2006. A resilient, evolving resource: partment of Education. ERIC Education Thesaurus. How to create a taxonomy {Electronic version}. Retrieved December, 2006 from the Education Re- Business information review 22: 199-204. Available source Information Center database. Access pro- from Sage Publications database (accessed August vided by Nanyang Technological University. 15, 2006). Hawkins, D. T. et al. 2003. Information Science Ab- Roberts-Witt, Sarah. L. 2000. Practical taxonomies. stract: Tracking the literature of information sci- Available http://www.destinationKM.com/articles/ ence. Part2: A new taxonomy for information sci- default.asp?articleid=684 (accessed July 15, 2005). ence {Electronic version}. Journal of the American Saeed, Hamid and Chaudhry, Abdus. S. 2002. Using Society for Information Science and Technology 54: Dewey Decimal Classification Scheme (DDC) for 771-81. Available from ProQuest ABI/INFORM building taxonomies for knowledge organization. Archive Complete database (accessed July 10, Journal of documentation 58: 575-84. 2005). Schein, Edgar H. 1970. Organizational psychology IFLA. Education and Training Section. 2000. Guide- (2nd ed.). Englewood Cliffs, NJ: Prentice-Hall. lines for Professional Library/Information Educa- Sharma, Ravi. S. et al. 2008. Developing corporate tional Programs – 2000. Available http://www. taxonomies for knowledge auditability: A frame- ifla.org/VII/s23/bulletin/guidelines.htm (accessed work for good practices. Knowledge organization November 3, 2006). 35: 30-46. Mentzas, G. 1994. A functional taxonomy of com- Singhal, M. and Nath, S. S. 2008. Chapter 27. An ap- puter-based information systems {Electronic ver- proach toward taxonomies. In I V Malhan and sion}. International journal of information man- Shivarama Rao K ed. Perspectives on knowledge agement 14: 397-410. Available from ScienceDirect management. Lanham, Maryland: Scarecrow, pp. database (accessed July 12, 2005). 391-406. Proquest – CSA Social Science. LISA library and in- Vizine-Goetz, Diane. 2002. Classification schemes formation science thesaurus. Retrieved December, for Internet resources revisited. Journal of Internet 2006 from the Library and Information Science cataloging 5n4: 5-18. Abstracts database. Access provided by Library of Wang, Zhonghong et al. 2006. Potential and pros- Nanyang Technological University. pects of taxonomies for content organization. Redmond-Neal, Alice and Hlava, Marjorie. M. K. Knowledge organization 33: 160-69. 2005. Thesaurus of information science, technology, https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
Knowl. Org. 37(2010)No.4 269 Z. Wang, A. S. Chaudhry, and Ch. Khoo. Support from Bibliographic Tools to Build an Organizational Taxonomy... and librarianship, 3rd ed. Medford; New Jersey: In- formation Today. OCLC. 1978. Web Dewey. Available http://connexion. oclc.org/ (accessed December, 2006). Access pro- vided by WKW School of Communication & In- formation, Nanyang Technological University. https://doi.org/10.5771/0943-7444-2010-4-256 Generiert durch IP '46.4.80.155', am 16.07.2021, 19:23:11. Das Erstellen und Weitergeben von Kopien dieses PDFs ist nicht zulässig.
You can also read