examining accesses by country, language and area of knowledge etd 2011 – cape town
TRANSCRIPT
![Page 1: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/1.jpg)
Examining Accesses by Country, Language and
Area of Knowledge
ETD 2011 – Cape Town
![Page 2: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/2.jpg)
Ana Pavani
Laboratório de Automação de Museus, Bibliotecas Digitais e Arquivos
Departamento de Engenharia ElétricaPontifícia Universidade Católica do Rio de JaneiroBrazil
[email protected] http://www.maxwell.lambda.ele.puc-rio.br/
ETD 2011 – South Africa
![Page 3: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/3.jpg)
This work is a continuation of a work presented last year in Austin. The two works differ in the following aspects:
In 2010, there were 71 data sets and this work considers 85 (20% more)
East Timor was included because accesses from this country have started happening
The UNDP has changed the way HDI is computed, so this data has been updated, as well as the populations of the countries
My co-author left the university, so this time I am by myself
![Page 4: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/4.jpg)
ETDs, PUC-Rio, BDTD & NDLTD
![Page 5: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/5.jpg)
PUC-RioRio de Janeiro
Brazil
![Page 6: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/6.jpg)
PUC-Rio is a small private university. It is divided in 3 centers and each has graduate programs:
CTCH (Humanities) – 6
CCS (Social Sciences) – 10
CTC (Science & Technology) – 10
The oldest graduate program (EE) started in 1963. The newest graduate program is less than 5 years old.
![Page 7: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/7.jpg)
Characteristics of PUC-Rio’s ETD program:
First published ETD – May 2000
ETDs became mandatory – Aug 2002
Number of ETDs – 5,694 (Jun 2011)
CTCH – 1,442 CCS – 1,291 CTC – 2,961
Yearly average number of defended T&Ds(*) – 590
(*) 2007, 2008, 2009 & 2010; (**) 2006, 2007, 2008 & 2009. There is retrospective digitization.
![Page 8: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/8.jpg)
ETDs are made available in chapters (graduate school regulation – please, don’t ask me the reason!, but it will change as of Oct 2011)
![Page 9: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/9.jpg)
All CTCH CCS CTC
Number of ETDs 5,694 1,442 1,291 2,961
Average number of ETDs – June 2004 to June 2011 3,553.1 888.9 726.1 1,938.3
Average number of partitions – June 2011 7.3 7.9 7.3 7.0
Average of averages number of partitions – June 2004 to June 2011
6.93 7.82 6.83 6.57
![Page 10: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/10.jpg)
PUC-Rio’s ETDs, BDTD(*) and NDLTD (**) :
Number of BDTD institutions – 97 (OAI-PMH data providers)
Number of BDTD metadata records – 170K+ (BDTD is an OAI-PMH data and service provider)
BDTD records are/were harvested by OCLC and other institutions, and made available worldwide
Brazilian ETDs are the largest collection in Portuguese available worldwide
(*) BDTD – Biblioteca Digital de Teses e Dissertações = Brazilian Nat’l Consortium. (**) You must know what NDLTD stands for!!!
![Page 11: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/11.jpg)
Accesses to PUC-Rio’s ETDs:
Access logs saved since – Jun 2004
Number of monthly logs when article was written – 85
![Page 12: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/12.jpg)
pt & es IN THE WORLD
![Page 13: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/13.jpg)
Worldwide WesternLanguages
Internet
Portuguese 7th 3rd 6th
Spanish 2nd 1St 3rd
pt is the official or one of the official languages of:
Angola Brazil Cape Verde Equatorial Guinea (*) East Timor (**) Guinea-Bissau Macau (***) Mozambique Portugal Sao Tome and Principe
es is the official or one of the official languages of:
Argentina Bolivia Chile Colombia Costa Rica Cuba Dominican Rep Ecuador El Salvador Equatorial Guinea (*) Guatemala Honduras Mexico Nicaragua Panama Paraguay Peru Puerto Rico Spain Uruguay Venezuela
(*) es & pt official(**) less than 5% of the population know it; it was banned during the Indonesian rule(***) UNDP did not publish in the last report; other data were used
![Page 14: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/14.jpg)
Assumptions for the analysis:
ETDs are very specialized items – people who seek ETDs are highly educated
es and pt are quite similar languages – educated people who can speak one can read the other
es and pt-speakers are potential readers of PUC- Rio’s ETDs
2 countries were not considered:
Brazil – is the home country US – there are very large groups of es and pt-speaking
persons but neither one is the language of the country
![Page 15: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/15.jpg)
2 groups were defined:
“international group” – all countries except Brazil and the US
“pt+es group” – all countries that have pt and/or es as one of the official languages
Factors considered to influence accesses to ETDs:
Population size Level of education Access to the Internet
![Page 16: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/16.jpg)
DEALING WITH COUNTRIES DIFFERENCES
![Page 17: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/17.jpg)
Mexico has 110M
inhabitants
Sao Tome and Principe has
165K inhabitants
Portugal and Spain are in
EuropeArgentina and Honduras are
in Latin America
Angola and Mozambique are in Africa
Portugal has 10M
inhabitants
Spain has 45M inhabitants
Equatorial Guinea has the
2 languages
![Page 18: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/18.jpg)
Quantization of potential accesses from countries that are very different :
Need to find data on the factors that may influence accesses to ETDs:
Population size – easy Level of education – difficult (literacy rates are easy!) Access to the Internet – difficult All data should be considered in the same time-frame
Knowledge that the second and the third factors are dependent on how developed countries are
Knowlede that it was necessary to combine the 3 factors
![Page 19: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/19.jpg)
Decision on how to deal the countries differences:
Use UNDP’s HDI – Human Development Index that contains information on the second and the third factors (HDI combines indicators of life expectancy, education and income; the new way it is computed contains means years of schooling and expected years of schooling, going beyond literacy rates)
Decision to combine HDI with the population size
Index I = Population x HDI
![Page 20: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/20.jpg)
All CTCH
Total population 420,281,000 57,858,800
Average HDI 0.707 0.527
Index I 309,420,871 25,114,111
![Page 21: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/21.jpg)
Comments:
21 es-speaking and 10 pt-speaking countries (Equatorial Guinea was counted in both)
Average HDI for es-speaking countries is 34.16% higher than the other group
Population of the es-speaking countries is almost 7.4 times the population of the other group
Index I for the es-speaking group is 12.36 times the same index for the pt-speaking group
The expectation was to have many more accesses from es-speaking countries than from pt-speaking countries!!
![Page 22: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/22.jpg)
WORKING WITH DATA AND RESULTS
![Page 23: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/23.jpg)
Information:
Number of sets of data – 85 (one for each month)
For each set, 16 variables were computed (examples – number of countries, number of pt-speaking countries countries, total number of accesses, etc)
All data were computed for the complete set and for each of the 3 areas of knowledge
![Page 24: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/24.jpg)
From the sets (collection and areas ) side
![Page 25: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/25.jpg)
This analysis focused on the way the whole collection and each individual set – CTCH, CCS and CTC – were accessed from countries in different groups.
![Page 26: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/26.jpg)
Results:
Total number of countries that accessed ETDs – 204
CTCH – 183 CCS – 183 CTC – 189
Total number in the “international group” – 202
CTCH – 181 CCS – 181 CTC – 187
![Page 27: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/27.jpg)
Maximum number of countries in the “international group” in a month – 143
CTCH – 112 CCS – 108 CTC – 132
Maximum number of countries in the “pt+es group” in a month – 28 (maximum possible 30)
CTCH – 27 CCS – 27 CTC – 27
![Page 28: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/28.jpg)
Number of months with accesses from 100 or more countries – 42
CTCH – 18 CCS – 15 CTC – 32
Some percentages follow
![Page 29: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/29.jpg)
% accesses All CTCH CCS CTC
from the international group 8.48 7.99 7.89 9.12
in the international group from the es+pt-sepaking group 69.03 73.27 68.56 66.32
in the es+pt-speaking group from pt-speaking countries 82.07 87.11 84.44 77.35
in the international group from pt-speaking countries 56.65 63.83 57.89 51.30
in the international group from Portugal 49.74 57.39 49.54 44.69
in the es+pt-speaking group from Portugal 72.05 78.27 72.26 67.39
in the pt-speaking group from Portugal 87.89 88.92 85.57 87.12
![Page 30: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/30.jpg)
Comments:
Absolute values for CTC are higher – this area has the largest collection (higher than the sum of the others)
Percentages for CTC are lower, except for accesses from the “international group”
Is it more international? It seems that language is not very important in C&T
![Page 31: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/31.jpg)
From the accesses side
![Page 32: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/32.jpg)
This analysis focused on the way accesses behaved for the complete collection and how they spread among the sets – CTCH, CCS and CTC.
The collection and the sets have different profiles – numbers of ETDs and numbers of partitions. For this reason, normalization was necessary.
![Page 33: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/33.jpg)
Quantization of potential accesses to sets of works with different profiles:
Sets are very different in:
Numbers of ETDs Numbers of partitions per ETD
This means that numbers of accesses had to be normalized in order to compare accesses to the sets
This work presents a first attempt to quantize the way the “average ETD” in a set “attracts” accesses
![Page 34: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/34.jpg)
Decision on how to deal the sets differences:
Combine average numbers of ETDs with average of average numbers of partitions
Index EI = 1 / (average number of ETDs x average of average numbers of partitions)
![Page 35: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/35.jpg)
All CTCH CCS CTC
Index EI 0.000041 0.000144 0.000202 0.000079
![Page 36: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/36.jpg)
Average numbers of All CTCH CCS CTC
Accesses 740.40 901.73 755.93 644.32
Accesses from the int group 62.77 72.01 59.63 58.79
Accesses from the es+pt-speaking group 43.33 52.76 40.88 38.99
Accesses from the pt-speaking countries 35.56 45.96 34.52 30.16
Accesses from Portugal 31.22 41.32 29.54 26.27
numbers computed for the total numbers of acesses x index EI numbers to be viewed as accumulated monthly averages can be obtained dividing by 85
![Page 37: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/37.jpg)
Comments:
When normalized data is consideredn, the average number of accesses (per ETD in Science & Technology) from the international group is the lowest among all
The same happens with accesses from the es+pt and pt-speaking groups, and Portugal as well
The reason is that ETDs in this group have the lowest average of accesses per ETD among the 3 subsets
![Page 38: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/38.jpg)
When normalized data is considered, the average number of accesses (per ETD in Humanities) from the international group is the highest among all
The same happens with accesses from the es+pt and pt-speaking groups, and Portugal as well
The reason is that ETDs in this group have the highest average of accesses per ETD among the 3 subsets
![Page 39: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/39.jpg)
FINAL COMMENTS
![Page 40: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/40.jpg)
Percentage wise, international accesses are the most significant for ETDs in S&T
At the same time, the “average S&T ETD” “attracts” less international accesses than ETDs in other areas of knowledge and the “average Humanities ETD” “attracts” the most
In all areas of knowledge, accesses from:
es- and/or pt-speaking countries are the most significant pt-speaking countries are the most significant in the
“es+pt–group” Portugal are the most significant in the pt-group
![Page 41: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/41.jpg)
New ways of defining “attraction” should be examined
Results seem to indicate that language and HDI are important factors in accesses
![Page 42: Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town](https://reader038.vdocuments.net/reader038/viewer/2022103006/56649f0d5503460f94c211d2/html5/thumbnails/42.jpg)
Thank you!Muito obrigada!