In daily life, verbal communication often takes place in indoor situations with interfering sounds, where speech intelligibility (SI) is affected by (i) masking and (ii) reverberation. Both introduce spectral and temporal changes to the signal. A critical spatial configuration to assess (binaural) SI is a frontal target speaker and two interfering sources symmetrically placed to either side (± 60°). Here a spatial release from masking (SRM) is observed in comparison to co-located frontal target and interferers, showing that the auditory system can make use of temporally fluctuating interaural differences. Room reverberation affects the temporal representation of the target and maskers and, moreover, the interaural differences depending on the spatial configuration and room acoustical properties. Here the effect of room acoustical properties (room size, T60, frequency dependency of T60), temporal structure of the interferers, and direct to reverberation ratio (DRR) on speech reception thresholds (SRT) and SRM were systematically assessed in a simulated room using headphone-based virtual acoustics. For constant T60 and DRR a different room size resulted in, e.g., significantly different SRTs but similar SRMs, implying the temporal structure of reverberation is less relevant for exploiting binaural cues. Data are discussed and compared to predictions of a binaural SI model.