Akteure
30 Einträge im Katalog
Akteursliste
- Organisation
Snapscreen
Snapscreen (Wien) entwickelt eine intelligente Video-Erkennung für lineares TV und Streaming: Mit Computer-Vision-Fingerprint-Technologie identifiziert die Software live ausgestrahlte TV- und OTT-Inhalte anhand eines Smartphone-Fotos vom Bildschirm. Die Lösung wird als SDK für Br
WienMedien & AV - Organisation
Newsadoo
Newsadoo (Linz) betreibt eine KI-gestützte Nachrichtenplattform, die Inhalte automatisch sammelt, mittels künstlicher Intelligenz versteht und sortiert und sie personalisiert sowie themenspezifisch zugänglich macht. Der personalisierte Newsfeed wird laufend anhand des Leseverhalt
LinzMedien & AV - Organisation
FH St. Pölten – Forschungsgruppe Media Computing
Die Forschungsgruppe Media Computing ist eine institutionelle Forschungsgruppe der Fachhochschule St. Pölten und betreibt Grundlagen- und angewandte Forschung zu interaktiven Multimedia-Systemen. Zu den Forschungsschwerpunkten zählen Computer Vision, Information Visualization, Vi
St. PöltenMedien & AV - Organisation
Cortical.io
Cortical.io (Wien) entwickelt eine Natural-Language-Understanding-Plattform auf Basis seiner „Semantic Folding“-Technologie, die Texte in semantische Fingerabdrücke nach Vorbild der menschlichen Großhirnrinde überführt. Die Software durchsucht, extrahiert, annotiert und klassifiz
WienMedien & AV - Forschung
The Binaural Rendering Toolbox. A Virtual Laboratory for Reproducible Research in Psychoacoustics
The Binaural Rendering Toolbox (BRT) is a set of software libraries, applications, and definitions aimed as a virtual laboratory for psychoacoustic experimentation.The BRT is developed in the framework of the SONICOM project 1 and will include the algorithms developed in the 3D Tune-In Toolkit 2 in a new open, extensible architecture.At the core of the BRT Toolbox, a library provides C++ implementations of listener models, source models, and environment models, including a growing collection of portings to different audio frameworks such as PureData, MaxMSP and VST plugins, by means of the Avendish library.In addition, the BRT also includes an application controlled via the Open Sound Control (OSC) protocol.This paper describes the architecture of the BRT, its main features, and its application to reproducible psychoacoustics experiments.The toolbox provides a complete trace of the experiment, including the delivered binaural audio, annotated with the listener and source movements.For this purpose, a new SOFA convention is proposed to store dynamic measurements, facilitating their use in the Auditory Model Toolbox (AMT).
Medien & AV - Forschung
Deep learning’s shallow gains: a comparative evaluation of algorithms for automatic music generation
Abstract Deep learning methods are recognised as state-of-the-art for many applications of machine learning. Recently, deep learning methods have emerged as a solution to the task of automatic music generation (AMG) using symbolic tokens in a target style, but their superiority over non-deep learning methods has not been demonstrated. Here, we conduct a listening study to comparatively evaluate several music generation systems along six musical dimensions: stylistic success, aesthetic pleasure, repetition or self-reference, melody, harmony, and rhythm. A range of models, both deep learning algorithms and other methods, are used to generate 30-s excerpts in the style of Classical string quartets and classical piano improvisations. Fifty participants with relatively high musical knowledge rate unlabelled samples of computer-generated and human-composed excerpts for the six musical dimensions. We use non-parametric Bayesian hypothesis testing to interpret the results, allowing the possibility of finding meaningful non -differences between systems’ performance. We find that the strongest deep learning method, a reimplemented version of Music Transformer, has equivalent performance to a non-deep learning method, MAIA Markov, demonstrating that to date, deep learning does not outperform other methods for AMG. We also find there still remains a significant gap between any algorithmic method and human-composed excerpts.
Medien & AV