Project Achievements

Here you can read more about FashionBrain dissemination activities, publications and reports, and on the prototypes and technologies developed in the project.

Quick Summary


Events Attended

Academic Events

Event
Type
Date
Project representative who attended
34th IEEE International Conference on Data Engineering Conference April 2018 Ying Zhang, Martin Kersten
11TH Extremely Large Databases Conference Conference May 2018 Ying Zhang, Sjoerd Mullender
ACM SIGMOD/PODS International Conference on Management of Data Conference June 2018 Martin Kersten
the 7th International workshop on Testing Database Systems (DBTest) Workshop Jun. 1, 2018 Martin Kersten
The 9th biennial Conference on Innovative Data Systems Research (CIDR) Conference January 2019 Ying Zhang, Martin Kersten
35th IEEE International Conference on Data Engineering (ICDE 2019) Conference April 2019 Svetlin Stalinov
the 2019 ACM SIGMOD/PODS Conference Conference June 2019 Ying Zhang, Martin Kersten
45th International Conference on Very Large Data Bases Conference August 2019 Ying Zhang, Martin Kersten
The sixth AAAI Conference on Human Computation and Crowdsourcing Conference July 2018 Alessandro Checco, Gianluca Demartini
The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval Conference July 2018 Gianluca Demartini
The fifth AAAI Conference on Human Computation and Crowdsourcing Conference October 2017 Alessandro Checco
iConference 2018 Conference March 2018 Alessandro Checco
2017 Workshop on Hybrid Human-Machine Computing (HHMC 2017). Guildford, UK Workshop September 2017 Alessandro Checco
Machine learning meets fashion' workshop at KDD 2017 Workshop August 2017 Alessandro Checco
THE 3RD STRATEGIC WORKSHOP ON INFORMATION RETRIEVAL IN LORNE (SWIRL) Workshop February 2018 Gianluca Demartini
The 28th edition of the Australasian Database Conference, ADC 2017 Conference September 2017 Gianluca Demartini
Australasian Document Computing Symposium Conference December 2017 Gianluca Demartini
Digital Transformation & Global Society (DTGS 2018) Conference June 2017 Gianluca Demartini
2017 Conference on Empirical Methods on Natural Language Processing (EMNLP 2017) Conference September 2017 Alan Akbik, Duncan Blythe
ReWork Machine Learning Summit 2017 Seminar October 2017 Roland Vollgraf
Thirty-first Conference on Neural Information Processing Systems (NIPS 2017) Conference December 2017 Roland Vollgraf
11th Edition of the Language Resources and Evaluation Conference (LREC 2018) Conference May 2018 Alan Akbik
The 27th International Conference on Computational Linguistics (COLING 2018) Conference August 2018 Alan Akbik
LIBER Annual Conference (LIBER 2018) Conference July 2018 Alan Akbik
International Conference on Computer Vision (ICCV 2017) Conference October 2018 Marko Jocic, Matthias Dantone
CrowdBias 2018 Workshop July 2018 Alessandro Checo, Gianluca Demartini
IEEE BigComp2019 Conference March 2019 Alexander Löser (Tutorial Chair)
Second Workshop on Software Foundations for Data Interoperability Workshop February 2019 Alexander Löser (Chair)
35th IEEE International Conference on Data Engineering (ICDE 2019) Conference April 2019 Svetlin Stalinov (demo)
Dagstuhl Seminar on Multi-Document Information Consolidation Seminar April 2019 Sebastian Arnold (speaker)
ACM SIGMOD/PODS International Conference on Management of Data (SIGMOD/PODS 2019) Conference July 2019 Ying Zhang (Industrial track chair), Martin Kersten (General chair)
Data Power 2019 Conference September 2019 Alessandro Checco (speaker)
ACL 2019 Conference July 2019 Sebastian Arnold (speaker)
ACM CIKM 2019 Conference Nov 2019 Benjamin Winter (Speaker), Alexander Löser (Speaker)
HCOMP 2019 Conference October 2019 Alessandro Checco (speaker)
Northern Lights Deep Learning Workshop, NLDL 2019 Workshop February 2019 Alan Akbik
Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2019 Conference June 2019 Alan Akbik
First symposium on Biases in Human Computation and Crowdsourcing Workshop October 2019 Alessandro Checco (speaker)

Industry Events

Name of Industrial Event
Venue
Big Data, Amsterdam v 6.0 Funda, Amsterdam, NL January 2017
Deep Learning for Text Mining Tasks , inovex GmbH Hamburg, DE February 2017
Mooc Artificial Intelligence, acatech, CEBIT Hannover, DE March 2017
ACM Distinguished Speaker talk Accenture Latvia April 2017
Amsterdam Artificial Intelligence & Deep Learning (H2O & Booking.com) Booking.com, Amsterdam, NL April 2017
Artificial Intelligence Day , Springer Nature Berlin, DE May 2017
Panel debate Let's talk about Data Products, inovex GmbH Cologne, DE May 2017
Panel Debate Data Products Whats Next, inovex GmbH Hamburg, DE May 2017
"Deep Learning & AI" by Scyfer #1 Impact Hub Amsterdam, NL May 2017
CWI in Bedrijf CWI Amsterdam, NL May 2017
amst-R-dam Simple Imputation and Date Padding CWI Amsterdam, NL May 2017
Data Products and Exchange Hasso Plattner Insititute, Potsdam, DE June 2017
Text and data mining(TDM) workshop in European Parliament Brussels, BE June 2017
PyData Amsterdam: Data Science week edition @ Flow Traders Flow Traders, Amsterdam, NL June 2017
Smart Cities 2.0 congres Figi Zeist, NL June 2017
ADS Drinks & Pizza Summer Startup UvA, Amsterdam, NL June 2017
ADS Coffee & Data: Visual Analytics UvA, Amsterdam, NL July 2017
"So how does Tensorflow work?", guest star Siraj Raval Google Netherlands, Amsterdam August 2017
New challenges in Reinforcement Learning: Dr. O. Vinyals (Google DeepMind) Amsterdam Science Park, NL September 2017
Shoptalk Europe Copenhagen, DK October 2017
European Big Data Value Forum 2017 Paris, FR November 2017
Data Datives 2017 Berlin, DE November 2017
20e editie Data Donderdag - ING, NS, Growth Tribe, Valuemaat GoDataDriven, Amsterdam, NL November 2017
CWI Lectures on Machine Learning CWI Amsterdam, NL November 2017
ADS Festive Drinks & Data: 2017 Highlights & Looking Forward to 2018 Amsterdam Business School, NL December 2017
Influx/Days London, UK January 2018
Handelsblatt goes future: Artifical Intelligence” Conference Munich, DE February 2018
ProductTank Berlin , Data Products Mircosoft, Berlin, DE March 2018
SAP Conference on Machine Learning Berlin, DE March 2018
Shoptalk Las Vegas, USA March 2018
Federal Minisitry of Economics: German Finish Information Exchange Finish Embassy Berlin, DE April 2018
Brussels TechSummit 2018 Brussels, BE June 2018
K5 Berlin, DE June 2018
AI Expo Europe 2018 Amsterdam Rai, NL June 2018
ADS Drinks & Data Summer Startup Amsterdam Business School, NL June 2018
FashionTech Berlin, DE July 2018
Big Data Expo Utrecht, NL September 2018
HiPEAC CSW Autumn 2018 Heraklion, GR October 2018
EBDVF 2018 Vienna, AT November 2018
ACE startup meeting Amsterdam, NL November 2018
Meeting organised by Dutch consulate to meet the local Big Data bureau and AI companies ChongQing, CN November 2018
HiPEAC, European Network on High Performance and Embedded Architecture and Compilation Valencia, SP January, 2019
H2020 Successful R&I in Europe 2019 - 10th European Networking Event Düsseldorf, DE February 2019
Data Warehousing & Business Intelligence summit Utrecht, NL March 2019
FOX AI Summit Köln, DE May 2019
Swedish German Business Days (Swedish Embassy) Berlin, DE November 2019
Austrian German Business Days (BMVIT and BMWi) Berlin, DE November 2019
Tagesspiegel 5th Digital Future Science 2019 Berlin, DE March 2019
Data Warehousing & Business Intelligence summit Utrecht, NL March 2019

Other Events

Event
Venue
Date/s
Project representative who attended
Type
Description
Data Science at ASOS.com ASOS.com HQ (London) August 2018 Paul Clough Presentation Presentation of FashionBrain and The University of Sheffield research in Data Science
Startup Qualifiction.com EXIST (BMWi) July 2017 Alexander Löser Startup fundation Text Mining for spotting bestsellers
Startup Beezdata.de BerlinStartupGrant January 2018 Alexander Löser Startup fundation Matching NGOs and Trusts
FashionBrain with projectstarling.com Online September 2018 Alessandro Checco Presentation Presentation of FashionBrain and collaboration plans
Crowdsourcing papers presentation University of Queensland October 2018 Alessandro Checco Presentation Presentation of FashionBrain research in Crowdsourcing
IDEL Paper (D4.3) IEEE BigComp2019 February 2019 Alexander Löser Presentation Best Paper Award (145 submissions, 42 Accepted)
AthNLP 2019 NCSR Demokritos September 2019 Benjamin Winter, Tom Oberhauser Summer School Exchange of information and experience among European NLP researchers

Publications

Academic Publications

Let's Agree to Disagree: Fixing Agreement Measures for Crowdsourcing
Alessandro Checco, Kevin Roitero, Eddy Maddalena, Stefano Mizzaro and Gianluca Demartini
HCOMP 2017
Conference
October 2017
Understanding Engagement through Searching Behaviour Mengdie Zhuang, Gianluca Demartini and Elaine Toms CIKM 2017 Conference November 2017
Considering Assessor Agreement in IR Evaluation Eddy Maddalena, Kevin Roitero, Gianluca Demartini and Stefano Mizzaro ICTIR 2017 Conference October 2017
FashionBrain Project: A Vision for Understanding Europe's Fashion Data Universe Alessandro Checco , Gianluca Demartini, Alexander Löser, Ines Arous, Matthias Dantone, Richard Koopmanschap, Svetlin Stalinov, Martin Kersten, Ying Zhang KDD Fashion 2017 Workshop https://www.overleaf.com/docs/rxrzrqhrwtkj/pdf August 2017
The Projector: An Interactive Annotation Projection Visualization Tool Alan Akbik and Roland Vollgraf EMNLP 2017 Conference http://www.aclweb.org/anthology/D17-2008
ZAP: An Open-Source Multilingual Annotation Projection Framework Alan Akbik and Roland Vollgraf LREC 2018 Conference http://www.lrec-conf.org/proceedings/lrec2018/pdf/301.pdf May 2018
FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German Leonidas Lefakis, Alan Akbik and Roland Vollgraf LREC 2018 Conference http://www.lrec-conf.org/proceedings/lrec2018/pdf/319.pdf May 2018
Love at First Sight: MonetDB/TensorFlow Richard Koopmanschap, Ying Zhang and Martin Kersten ICDE 2018 Other
Love at First Sight: MonetDB/TensorFlow Richard Koopmanschap, Ying Zhang and Martin Kersten XLDB2018 Other
In-Database Machine Learning with MonetDB/TensorFlow Richard Koopmanschap, Ying Zhang, Martin Kersten XLDB2018 Other
On Fine-Grained Relevance Scales Kevin Roitero, Eddy Maddalena, Gianluca Demartini, and Stefano Mizzaro SIGIR2018 Other July 2018
Investigating User Perception of Gender Bias in Image Search: The Role of Sexism Jahna Otterbacher, Alessandro Checco, Gianluca Demartini, and Paul Clough SIGIR2018 Conference July 2018
On the Volatility of Commercial Search Engines and its Impact on Information Retrieval Research Jimmy, Guido Zuccon, and Gianluca Demartini SIGIR2018 Other July 2018
The Evolution of Power and Standard Wikidata Editors: Comparing Editing Behavior over Time to Predict Lifespan and Volume of Edits Cristina Sarasua, Alessandro Checco, Gianluca Demartini, Djellel Difallah, Michael Feldman, and Lydia Pintscher Journal of CSCW Journal June 2018
An Introduction to Hybrid Human-Machine Information Systems Gianluca Demartini, Djellel Eddine Difallah, Ujwal Gadiraju, and Michele Catasta Foundation and Trends in Web Science Other December 2017
All That Glitters is Gold - An Attack Scheme on Gold Questions in Crowdsourcing Alessandro Checco, Jo Bates, and Gianluca Demartini HCOMP 2018 Conference https://aaai.org/ocs/index.php/HCOMP/HCOMP18/paper/view/17925/16904 July 2018
Investigating Stability and Reliability of Crowdsourcing Output Rehab K. Qarout, Alessandro Checco, Kalina Bontcheva CrowdBias 2018 Workshop July 2018
RelVis: Benchmarking OpenIE Systems Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser ISWC 2017 Conference http://ceur-ws.org/Vol-1963/paper527.pdf October 2017
Analysing Errors of Open Information Extraction Systems Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser EMNLP 2017 Workshop Workshop https://www.aclweb.org/anthology/W17-5402.pdf
Contextual String Embeddings for Sequence Labeling Alan Akbik, Duncan Blythe and Roland Vollgrad COLING 2018 Conference http://aclweb.org/anthology/C18-1139 August 2018
All Those Wasted Hours: On Task Abandonment in Crowdsourcing Lei Han, Kevin Roitero, Ujwal Gadiraju, Cristina Sarasua, Alessandro Checco, Eddy Maddalena and Gianluca Demartini WSDM 2019 Conference https://www.researchgate.net/publication/329238136_All_those_wasted_hours_On_task_abandonment_in_crowdsourcing February 2019
IDEL: In-Database Neural Entity Linking Torsten Kilias, Alexander Löser, Felix A. Gers, Richard Koopmanschap, Ying Zhang and Martin Kersten IEEE BigComp2019 Conference http://www.bigcomputing.org/accepted_papers/ February 2019
RecovDB: accurate and efficient missing values recovery for large time series Ines Arous, Mourad Khayati, Philippe Cudré-Mauroux, Ying Zhang, Martin Kersten and Svetlin Stalinlov ICDE 2019 Conference http://conferences.cis.umac.mo/icde2019/ April 2019
Pooled Contextualized Embeddings for Named Entity Recognition Alan Akbik, Tanja Bergmann and Roland Vollgraf NAACL-HLT 2019 Conference https://www.aclweb.org/anthology/N19-1078/ March 2019
Deadline-Aware Fair Scheduling for Multi-Tenant Crowd-Powered Systems Djellel Difallah, Alessandro Checco, Gianluca Demartini and Philippe Cudré-Mauroux Transactions on Social Computing Journal https://dl.acm.org/citation.cfm?id=3301003 January 2019
Implicit Bias in Crowdsourced Knowledge Graphs Gianluca Demartini HumBL-WWW2019 Workshop May 2019
The Impact of Task Abandonment in Crowdsourcing Lei Han, Kevin Roitero, Ujwal Gadiraju, Cristina Sarasua, Alessandro Checco, Eddy Maddalena and Gianluca Demartini IEEE Transactions on Knowledge and Data Engineering (TKDE) Journal October 2019
Platform-related Factors in Repeatability and Reproducibility of Crowdsourcing Tasks Rehab Qarout, Alessandro Checco, Gianluca Demartini and Kalina Bontcheva HCOMP 2019 Conference https://www.humancomputation.com/papers.html August 2019
Scalable recovery of missing blocks in time series with high and low cross-correlations Mourad Khayati, Philippe Cudré-Mauroux and Michael H. Böhlen KAIS 2019 Journal http://kais.bigke.org November 2019
SECTOR: A Neural Model for Coherent Topic Segmentation and Classification Sebastian Arnold, Rudolf Schneider, Philippe Cudré-Mauroux, Felix A. Gers, Alexander Löser: TACL 2019 Journal https://www.mitpressjournals.org/doi/full/10.1162/tacl_a_00261 July 2019
FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP Alan Akbik, Tanja Bergmann, Duncan Blythe, Kashif Rasul, Stefan Schweter and Roland Vollgraf NAACL-HLT 2019 Conference https://www.aclweb.org/anthology/N19-4010/ April 2019
Multilingual Sequence Labeling With One Model Alan Akbik, Tanja Bergmann and Roland Vollgraf NLDL 2019 Workshop https://alanakbik.github.io/papers/nldl2019.pdf December 2018
Adversarial Attacks on Crowdsourcing Quality Control Alessandro Checco, Jo Bates, Gianluca Demartini Journal of Artificial Intelligence Research (JAIR) Journal December 2019
OpenCrowd: Leveraging Open-Ended Answers Aggregation for Finding Social Influencers Ines Arous, Jie Yang, Mourad Khayati and Philippe Cudré-Mauroux WWW 2020 Conference https://www2020.thewebconf.org January 2020
Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series Mourad Khayati, Alberto Lerner, Zakhar Tymchenko and Philippe Cudré-Mauroux VLDB 2020 Conference http://www.vldb.org/pvldb/vol13/p768-khayati.pdf January 2020

Press Releases/Newsletters

Name
Type
Link
datanami 11/04/2019 Online news https://www.datanami.com/this-just-in/monetdb-solutions-appoints-niels-nes-as-cto/
EEnterpriseAI news 10/07/2019 Online news https://www.enterpriseai.news/2019/10/07/monetdb-solutions-secures-an-investment-from-servicenow-to-help-large-enterprises-drive-digital-transformation-at-scale/
HiPEAC news 14/12/2017 Online news https://www.hipeac.net/press/6829/ten-winners-selected-for-the-2017-hipeac-tech-transfer-awards/
HiPEAC info 51 12/07/2017 Magazine https://www.hipeac.net/assets/public/publications/newsletter/hipeacinfo51_final_corrected.pdf
Computer Weekly 07/07/2017 Online article http://www.computerweekly.com/news/450422330/Dutch-database-design-drives-practical-innovation
Handelsblatt 19/2/18 Online article http://veranstaltungen.handelsblatt.com/kuenstliche-intelligenz/2018/03/03/ki-als-enabler/
Beuth-Magazin 01/04/2017 Cover Story http://www.beuth-hochschule.de/fileadmin/oe/pressestelle/beuth-magazin/2017-1_beuth-magazin.pdf
The University of Sheffield 01/05/2017 Online article https://www.sheffield.ac.uk/faculty/social-sciences/news/fashion-algorithm-future-trends-project-1.671380
The University of Sheffield 15/11/2017 Online article https://www.sheffield.ac.uk/is/research/projects/fashionbrain
Tagesspiegel Online news https://science-match.tagesspiegel.de/digital-future-2018/speakers/alexander-loser
Exasol Magazine Online article https://www.exasol.com/en/blog/interactive-text-mining-exasol-indrex-mm/
KI-Berlin 01/06/2019 Online article https://ki-berlin.de/en/blog/article/prof-dr-alexander-loeser-beuth-university-of-applied-sciences/

Presentations

FashionBrain Project Presentation

Dissemination Material

FashionBrain Factsheet

FashionBrain Project Poster

FashionBrain Project Vision Paper

FashionBrain Project Leaflet

FashionBrain Glossary

Lightening talk at XLDB 2018

FashionBrain description

IDEL In Database Entity Linkage

 

Prototypes and Technologies

Name
Partner(s)
Type
Link
Description
MonetDB with extended windowing functions April 2018 MDBS software https://www.monetdb.org/Downloads MonetDB Apr2019 feature release including the extended SQL windowing functions
In-Database Machine Learning April 2018 MDBS software https://github.com/MonetDB MonetDB-Tensorflow integration through SQL Python UDFs which allows executing machine learning tasks inside the kernel of the MonetDB RDBMS
MonetDB continuous query extension MDBS software https://dev.monetdb.org/hg/MonetDB/shortlog/trails MonetDB extended with a continuous query processing engine for IoT/Streaming data
MonetDB JSON (renewed) MDBS software https://dev.monetdb.org/hg/MonetDB/shortlog/json Renewed support for JSON data loading and processing in MonetDB
RecovDB August 2018 UNIFR, MDBS software http://revival.exascale.info Integration of UNIFR's CD-based technology with MonetDB for missing value recovery in time series
Agreement Phi August 2017 USFD software http://agreement-measure.sheffield.ac.uk/ Source code and live demo of a novel agreement measure for crowdsourcing
Crowdsourcing logging interface May 2018 USFD API https://github.com/AlessandroChecco/herokulogging/ Append-only, ephemeral in-memory logging REST interface. https://fast-logging.herokuapp.com/
Gender bias dataset February 2018 USFD dataset https://github.com/AlessandroChecco/gender_bias Dataset used in "Investigating User Bias in Image Search: A Cross-Regional Study". It contains 2,811 query-description comparisons for 281 different users.
Tasty Entity Linkage June 2018 Beuth API http://demo.datexis.com/tasty/ Entity Linkage against Wikipedia
Scalable Crowdsourced Social Media Annotation Demo Fashwell API ./scalable-crowdsourced-social-media-annotation-demo/
Product Taxonomy Linking Fashwell API ./product-taxonomy-linking/
Demo on Zalando deep learning powered search engines Zalando software ./demo-on-zalando-deep-learning-powered-search-engines/
Tasty feat. IDEL Demonstration April 2018 Beuth software ./beuth-tasty-feat-idel-demonstration/
BERT Layerwise Analysis December 2019 Beuth software https://demo.datexis.com/visbert/ We visualize the most important representation BERT for text mining in FashionBrain
Flair release 0.4.4 October 2019 Zalando software https://github.com/zalandoresearch/flair/releases/tag/v0.4.4 Release 0.4.4 of the popular Flair library

Deliverables

Public Deliverables