spark bi-clustering - ow2 big data initiative, altic
TRANSCRIPT
![Page 2: Spark Bi-Clustering - OW2 Big Data Initiative, altic](https://reader033.vdocuments.net/reader033/viewer/2022052901/556a6f5ed8b42ab0468b51cd/html5/thumbnails/2.jpg)
Twitter #ow2 #sl2014 @Altic_buzzwww.ow2.org
smart #OpenSource Software #BusinessIntelligence
assembler
![Page 3: Spark Bi-Clustering - OW2 Big Data Initiative, altic](https://reader033.vdocuments.net/reader033/viewer/2022052901/556a6f5ed8b42ab0468b51cd/html5/thumbnails/3.jpg)
Twitter #ow2 #sl2014 @Altic_buzzwww.ow2.org
Altic tools / approach
• ETL : Talend
• Big Data : Spark, Hortonworks Data Platform (Hadoop), Elasticsearch
• Data Warehouse : InfiniDB
• Reporting : JasperReports, Birt
• OLAP : Mondrian, Palo
• Dashboard : Tableau Software, D3
• BI platform : SpagoBI
![Page 4: Spark Bi-Clustering - OW2 Big Data Initiative, altic](https://reader033.vdocuments.net/reader033/viewer/2022052901/556a6f5ed8b42ab0468b51cd/html5/thumbnails/4.jpg)
Twitter #ow2 #sl2014 @Altic_buzzwww.ow2.org
Biclustring on Big Data
● Tugdual SARAZIN, PhD
● ALTIC
● LIPEN (Paris 13)
● Biclustring
● a Biclustring algorithm on Big Data
● Spark
● Based on SOM – Self Organized Map
● Available on Github : Spark-Clustering
![Page 5: Spark Bi-Clustering - OW2 Big Data Initiative, altic](https://reader033.vdocuments.net/reader033/viewer/2022052901/556a6f5ed8b42ab0468b51cd/html5/thumbnails/5.jpg)
Twitter #ow2 #sl2014 @Altic_buzzwww.ow2.org
Integration with SpagoBI
● Spark Bi Clustering can be an engine for SpagoBI
● Define a data set as input
● Execute the biclustering with appropriate settings
● Store result in a defined format
– Databases– Big data storage (HDFS)– SpagoBI Dataset
![Page 6: Spark Bi-Clustering - OW2 Big Data Initiative, altic](https://reader033.vdocuments.net/reader033/viewer/2022052901/556a6f5ed8b42ab0468b51cd/html5/thumbnails/6.jpg)
Twitter #ow2 #sl2014 @Altic_buzzwww.ow2.org
Integration with Talend
● Spark Biclustering can be a component for Talend Big Data
● Add new features to existing Talend Big Data components
– Biclustering● Allow to map your data
![Page 7: Spark Bi-Clustering - OW2 Big Data Initiative, altic](https://reader033.vdocuments.net/reader033/viewer/2022052901/556a6f5ed8b42ab0468b51cd/html5/thumbnails/7.jpg)
OW2 Big Data Initiative
Charly Clairmont, ALTIC
Charly CLAIRMONT
@egwada / @[email protected]
http://www.altic.org
Thanks