Ghting. In (B) and (C), the colored wheels indicate relative weighting of scores from individual screens (see Figure S1A). Names and dots in red are examples from the TC-NER training category (“TCR”). Please note that, for clarity, even if a candidate scores highly in several scoring schemes, it is typically only indicated once. (D) Biased list of interesting proteins and protein complexes that scored highly. (E) Proteins from the TC-NER training category that scored above the Z score threshold (indicated by red bar) in screens across the multiomic approach (see Figure S1B). See also Figures S2, S3, and S4.TCR ALLTCR1.ALL0.3 4 Point score–1 0 1 Av. aggregate z-score/screenCNumber of proteinsDWeightedHYLS CSB ACIN1 PCF11 STK19 SMU1 HNRNPCL1 P4HB RPRD1A ERCC2 PHF3 YBX1/2 PDIA3 ASCCTCRfor EPZ-5676 mechanism of action information value regarding the known TC-NER factors. The underlying assump-2 -1 0 1 2 3 tion is that unknown factors in the tranAv. aggregate z-score/screen Splicing: HNRNPCL1 scription-related DNA damage response CEB2 E PIE IRA SNW1/SNW1-complex UL2 will often (although obviously not invari SY1 OPS8 SMU1 OPS7B RCC5 RCC1 ably) follow the same pattern in the data NOT8 ACIN1 (Acinus) AF1 AB2 HOC1 as the known TC-NER factors. As ex TF2H4 OLR2L DB1 Chromatin: CEA1 pected, the screens had varying abilities DR61 OLR2H Cohesin complex OLR2G OLR2D OLR2C to capture proteins from this training SETD2 Miscellanous: AF1 DC73 UPT16H SRP1 MeCP1 complex YBX1 and 2 category, with the CSB interactome, dam TR9 CEA2 UPT4H CACYBP OLE2 age-induced ubiquitylation, and RNAi low NOT3 CP HYLS1 (Hydroethalus CNA CNA RCC2 transcription (S)-(-)-Blebbistatin solubility particularly effective in unsyndrome) ARP1 OLR2A OLR2E TMPO (nuclear lamina) HOC2 covering such factors (Figure S1A). OLR2J EO1 TR OLR2B Weighting the individual screening experi UL4A UL4B VSSA RCC8 RCC3 ments according to their performance in TF2H2 TF2H1 PS1 OPS2 this respect and applying it to score all pro RCA1 P53BP1 CEB3 FC1 teins increased the median score of known OT1L DC1 MGN1 LYREF TC-NER protein from 0.17 to 0.41 (Fig UPT5H BE2N PA1 PA3 RCC6 ure 6C; Table S9). PA2 It is important to emphasize that there is no single “correct way” of compiling score lists. However, if a factor scores highly no matter which method is used, this obviously increases confidence. Nevertheless, even factors that only from almost 2,200 proteins scoring in one screen, to only scored highly by one or two methods might still be interesting two genes, RPA1 and ASCC3, scoring in six (Figure 6A; Table and included with high confidence after an assessment of the unS9). Realizing that setting arbitrary Z score threshold for inclusion derlying core data. A non-exhaustive list of high-scoring promight not be ideal, we also ranked candidates based on aggre- teins, which we thought to be of particular interest and of high gate Z scores (Figure 6B; Table S9). None of these approaches confidence, is shown in Figure 6D. take into account the possibility that some screens might be Next, we determined which cellular pathways are enriched in much better at uncovering relevant factors than others. To the list of high-scoring proteins (Table S11). For simplicity, this address this, we created a comprehensive list of “transcription- analysis was performed with the data obtained by weighted repair coupling factors” (Table S10), based on an authoritative scoring (Figure 6C), but similar results were achieved using the rec.Ghting. In (B) and (C), the colored wheels indicate relative weighting of scores from individual screens (see Figure S1A). Names and dots in red are examples from the TC-NER training category (“TCR”). Please note that, for clarity, even if a candidate scores highly in several scoring schemes, it is typically only indicated once. (D) Biased list of interesting proteins and protein complexes that scored highly. (E) Proteins from the TC-NER training category that scored above the Z score threshold (indicated by red bar) in screens across the multiomic approach (see Figure S1B). See also Figures S2, S3, and S4.TCR ALLTCR1.ALL0.3 4 Point score–1 0 1 Av. aggregate z-score/screenCNumber of proteinsDWeightedHYLS CSB ACIN1 PCF11 STK19 SMU1 HNRNPCL1 P4HB RPRD1A ERCC2 PHF3 YBX1/2 PDIA3 ASCCTCRfor information value regarding the known TC-NER factors. The underlying assump-2 -1 0 1 2 3 tion is that unknown factors in the tranAv. aggregate z-score/screen Splicing: HNRNPCL1 scription-related DNA damage response CEB2 E PIE IRA SNW1/SNW1-complex UL2 will often (although obviously not invari SY1 OPS8 SMU1 OPS7B RCC5 RCC1 ably) follow the same pattern in the data NOT8 ACIN1 (Acinus) AF1 AB2 HOC1 as the known TC-NER factors. As ex TF2H4 OLR2L DB1 Chromatin: CEA1 pected, the screens had varying abilities DR61 OLR2H Cohesin complex OLR2G OLR2D OLR2C to capture proteins from this training SETD2 Miscellanous: AF1 DC73 UPT16H SRP1 MeCP1 complex YBX1 and 2 category, with the CSB interactome, dam TR9 CEA2 UPT4H CACYBP OLE2 age-induced ubiquitylation, and RNAi low NOT3 CP HYLS1 (Hydroethalus CNA CNA RCC2 transcription particularly effective in unsyndrome) ARP1 OLR2A OLR2E TMPO (nuclear lamina) HOC2 covering such factors (Figure S1A). OLR2J EO1 TR OLR2B Weighting the individual screening experi UL4A UL4B VSSA RCC8 RCC3 ments according to their performance in TF2H2 TF2H1 PS1 OPS2 this respect and applying it to score all pro RCA1 P53BP1 CEB3 FC1 teins increased the median score of known OT1L DC1 MGN1 LYREF TC-NER protein from 0.17 to 0.41 (Fig UPT5H BE2N PA1 PA3 RCC6 ure 6C; Table S9). PA2 It is important to emphasize that there is no single “correct way” of compiling score lists. However, if a factor scores highly no matter which method is used, this obviously increases confidence. Nevertheless, even factors that only from almost 2,200 proteins scoring in one screen, to only scored highly by one or two methods might still be interesting two genes, RPA1 and ASCC3, scoring in six (Figure 6A; Table and included with high confidence after an assessment of the unS9). Realizing that setting arbitrary Z score threshold for inclusion derlying core data. A non-exhaustive list of high-scoring promight not be ideal, we also ranked candidates based on aggre- teins, which we thought to be of particular interest and of high gate Z scores (Figure 6B; Table S9). None of these approaches confidence, is shown in Figure 6D. take into account the possibility that some screens might be Next, we determined which cellular pathways are enriched in much better at uncovering relevant factors than others. To the list of high-scoring proteins (Table S11). For simplicity, this address this, we created a comprehensive list of “transcription- analysis was performed with the data obtained by weighted repair coupling factors” (Table S10), based on an authoritative scoring (Figure 6C), but similar results were achieved using the rec.