Completed in 21st conference, 2021
Virtual Reality is becoming more and more prevalent in many domains. Along with this, VR technology has become increasingly advanced. VR Interfaces, however, still require further research to determine how they impact user experience and presence in virtual environments. In this paper, we explore the time-efficiency and overall “presence” of two UI patterns in a virtual environment, Diegetic and Non-Diegetic UI, for completing tasks related to 3D manipulation in order to determine which is the more effective form of UI for said tasks. We use a between-participants design and ask participants to complete specific tasks using the two UIs, measuring the time taken for each task and then having them take a brief presence questionnaire. Although we were unable to prove interface design has an impact on the effectiveness for 3D manipulation, we gained valuable insight into designing studies for 3D interface designs.
Recommended citation: Choudhry, Abdullah. (2021). "Comparing the Effectiveness of Diegetic and Non-Diegetic Interface Designs for 3D Manipulation Tasks in Virtual Reality" https://openriver.winona.edu/wsurrc/2021/schedule/11/
Completed in 2021
Decades of music exists in the world with no way to find the original multitrack recordings that they are composed of, whether it be a lack of access or just being lost to time. Music source separation is the process of using technology to isolate the individual components of a song after they have been fully arranged and mixed together. These components, or stems, can be the vocals, drums, bass, or any other instrument found in the song. By doing complex analysis on the structure of songs to determine which frequencies or waveform patterns belong to particular sources, a convolutional neural network can be trained to cleanly separate these sources from the full mix. Methods to perform this separation include the use of a spectrogram or a waveform. We examine the existing methods used by Spleeter, Demucs, and f90 Wave-U-Net. In the first phase of the experiment we implement three different U-Net architectures used by these methods using both one-dimensional and two-dimensional convolution. By examining the metrics of SDR (Signal Distortion Ratio), SAR (Signal to Artifacts Ratio) and ISR (Source Image to Spatial Distortion Ratio), we were able to determine the best architecture. From this architecture we move to phase two and implement a secondary U-Net to aid in further refining the results.
Recommended citation: Henning, Ryan; Choudhry, Abdullah; and Ma, Ming (2021) "Deep Learning Based Music Source Separation," SCSU Journal of Student Scholarship: Vol. 1 : Iss. 2 , Article 3." https://repository.stcloudstate.edu/joss/vol1/iss2/3