On Friday 11th of September I pitched the medical relation extraction work of my CrowdTruth colleague Anca Dumitrache at the third Amsterdam Data Science: Coffee and Data event. The purpose of this was to get in touch with researchers that have medical datasets that are for instance incomplete or contain errors. With our research, we want to investigate how we can improve the quality of this data. Several other interesting presentations on data science in the medical domain were given at this event, which was hosted on the top floor of the VU University Amsterdam. Together with Merel van Empel, we also presented our latest work on gamification of crowdsourcing for advancing biology using BioCrowd. Feel fee to try out the game and provide us with feedback.
On Monday 31st of August I presented the preliminary results of my work on sound representations during the weekly Artificial Intelligence meeting at the VU University Amsterdam. In this collaboration with Emiel van Miltenburg, a sound corpus is built with annotations on how people perceive these sounds. Sounds can often be interpreted in multiple ways, but tags in sound corpora do not directly relate to the acoustic features of sounds. Because of this limited representation of what can be heard in a sound, the ranking of search results is not optimal. In this research, we use crowdsourcing to build an annotated corpus of sounds from freesound.org with meaningful representations that are perceptually grounded. The presented slides can be seen below or on slideshare.
Seth Teicher of the crowdsourcing platform CrowdFlower has published a blog post about the CrowdTruth project. He summarizes what the implications of the research project are and what’s next for data enrichment and Watson.
Read the full story at the CrowdFlower blog: Crowd’s Collective Wisdom vs. Experts: Who Makes IBM Watson Smarter?