repositori akses terbuka di indonesia (pdf)

51
Repositori Akses Terbuka di Indonesia Ismail Fahmi, PhD. Inisiator Indonesia OneSearch (IOS) [email protected] Lokakarya Nasional PDII LIPI 10 Agustus 2016

Upload: ismail-fahmi

Post on 09-Jan-2017

286 views

Category:

Technology


2 download

TRANSCRIPT

Repositori Akses Terbuka di IndonesiaIsmail Fahmi, PhD.InisiatorIndonesia OneSearch (IOS)[email protected]

Lokakarya Nasional PDII LIPI10 Agustus 2016

2

1992 – 2007 S1, Teknik Elektro, ITB2003 – 2004 S2, Computational Linguistics, Universitas Groningen, Belanda2004 – 2009 S3, Computational Linguistics, Universitas Groningen, Belanda

2000 – 2003 Inisiator IndonesiaDLN (Digital Library Network pertama di Indonesia)Mengembangkan Ganesha Digital Library (GDL)Mendirikan Knowledge Management Research Group (KMRG) ITBMembangun Digital Library ITB

2009 – Sekarang Engineer di Weborama, Perusahaan berbasis big data (Paris/Amsterdam)2012 – Sekarang Co-Founder Awesometrics, Media Monitoring & Analytics Company2014 – Sekarang Founder PT. Media Kernels Indonesia, a Natural Language Processing Company2015 – Sekarang Konsultan Perpustakaan Nasional, Inisiator Indonesia OneSearch

Ismail Fahmi, [email protected]

Agenda

•Manfaat Open Access•Pengumpulan Data•Metadata•Otomasi Pengolahan•Temu Kembali•Semantic Web•Copyright

3

4

Jack Andraka15 tahun, USA

5

Open Access Around The World

6

Goal

7

“To have a robust national open access repository

discovery system”

INDONESIADiscovery System

OAI Discovery System

8

Merupakan sebuah

Indonesia OneSearch (IOS)

9

Sumber Data: Perpustakaan

10

Total:66.000+

* Sumber: Perpusnas 2016

Jenis Data

11

Katalog Buku E-Journal Digital Repository Museum

Discontinued

Pengumpulan Data

12

OAI-PMHa low-barrier interoperability framework

Standard Interoperability Protocol

~75% Repository di seluruh dunia ‘OAI-Compliant’

Skenario Harvesting

13

Any Platforms – Any Collections

14

Any Platforms Any Collections

Standard OAI-PMH

15

Harvester Provider(Repository)

Request Verbs:• Identify• ListMetadataformats• ListSets• ListIdentifiers• ListRecords• GetRecord

https://www.openarchives.org/OAI/openarchivesprotocol.html

Semua Harus Mengikuti Standard

16

http://an.oa.org/OAI-script? verb=ListRecords&

from=1998-01-15&set=physics:hep&metadataPrefix=oai_rfc1807

https://jurnal.uns.ac.id/index.php?journal=alchemy&page=oai&verb=ListRecords&

metadataPrefix=oai_dc badArguments:journal,page

https://www.openarchives.org/OAI/openarchivesprotocol.html

Metadata

17

Paling banyak dimplementasikan(berdasarkan data Indonesia OneSearch):• MARC à marcxml• DC à oai_dc

http://an.oa.org/OAI-script? verb=ListRecords&metadataPrefix=oai_dc

http://an.oa.org/OAI-script? verb=ListRecords&metadataPrefix=marcxml

Metadata Harus Standard

18

Semua platform software yang berbasis open sourceseperti OJS, Eprints, Dspace, dan Koha, sudah medukung dan comply dengan standard OAI-PMH dan metadata.

Software lokal cukup banyak yang tidak mendukung OAI-PMH, atau yang tidak comply.

SLIMS paling banyak digunakan di Indonesia, dan versi lama belum comply. Versi OAI-PMH untuk SLIMs dapat didownload di: http://wiki.onesearch.id/doku.php?id=oai-slims

Otomasi Pengolahan

19

Valid?

Repository Admin Harvester AdminOAI Harvester & Index

PeriodicHarvesting

No Yes

Temu Kembali

20

• Auto-Suggest• Relevancy Search• Faceting• Deduplication

Auto-Suggest

21

Relevancy & Facet

22

Sort by Relevancy, etc.

Facet

Duplicate Records

23

Deduplication

24

Sebelum Sesudah

Semantic Web

25

Contoh: Semantic Search

26

Fact Extraction

Fact Extraction

27

Fact extractedfrom document

Fact Extraction dalam Medical

28

Fact extractedfrom document

29

Knowledge Graph

30

Resource Description Framework (RDF)

31

IOS: Fact Extraction

32

Text Analysis

33

Text Analysis is:• the process of

analyzing unstructured text,

• extracting relevant information

• and then transformingthat information into structured information

• that can be leveragedin different ways

Contoh: Tesis “Hak Ulayat”

34

Contoh: Tesis dari UNDIP

35

Fullteks tesis: 112 halamanBahasa: Indonesia

36

S

P

O

Fact Graph

37

Fact Graph

38

Co-occurrence Analysis

39

Open vs Closed Access

40

Open vs Closed Access

41

Contoh: Tesis dari UAJ

42

Open Access: Enabling Innovation

43

Tesis dari UNDIP Tesis dari UAJ

Text Analysis: Manfaat

44

Semoga bisa menjadi dasarpemahaman:• interdisiplinaritas, • cross-disciplinarity, • transdisciplinarity, dan

multidisciplinarity

- Edda Priyanto – Dosen Ilmu Perpustakaan UGM

Open Access Copyright

45

1. Authors sign a publishing agreement where they will have copyright but grant broad publishing and distribution rights to the publisher.

2. The author chooses an end user license under which readers can use and share the article.

3. The publisher makes the article available online with the author's choice of end user license.

Text Analysis: Open Access Licensing

46

PDF, 55 pages, English

Knowledge Graph: Open Access

47

People

48

Road Map IOS

49

Tahap 1 (2015): OneSearch Portal

OneSearch Portal (Bibliografi)

• Software Indonesia OneSearch

• Harvesting data bibliografi• Protokol standard OAI-PMH• Repository: Katalog buku,

Jurnal Online, Repositoridigital

Tahap 2 (2016-2017): Text Analysis

Text Analysis (Full Teks)

• Crawling fullteks (PDF) TA, tesis, disertasi, laporan penelitin, danartikel jurnal dari Intitusi di Indonesia.

• Text analysis menggunakanteknologi NLP (Natural Language Processing)

• Information Extraction & Knowledge Mapping berbasisNLP

• Research Mapping antar Institusi

Tahap 3 (2018-2020): Layanan Anti PlagiarismNo Plagiarism (Services)

• Sistem dan LayananNoPlagiarism untuk karyaberbahasa Indonesia.

• Sumber: Wikipedia (Bahasa Indonesia), Online News, TA, Tesis, Disertasi, artikel jurnal, laporan penelitian (open access)

• Layanan online plagiarism checking untuk mahasiswa danpeneliti di Indonesia

Kesimpulan

50

• Open Access akan membantu mempercepat terjadinya INOVASI.

• Open Access Indonesia – Discovery System (OAI-DS), dibutuhkan untuk mengelola seluruh repositori Open Access dan seluruh jenis data (tidak terbatas pada e-jurnal).

• Indonesia OneSearch (IOS) adalah sebuah OAI-DS.• Text Analysis dan Fact Extraction dalam IOS merupakan

langkah awal membangun Knowledge Graph dari seluruh repositori Open Access di Indonesia.

Terimakasih

51

Ismail Fahmi, PhDEmail: [email protected]: 0812 8908 3894