TCRD v3.1.4 Changes =================== This README describes changes from v3.0.7 DisGeNET -------- - Added columns score and source to target2disease table. - Loaded disease associations derived from Expression Atlas data. - Changed column target2disease.doid to target2disease.did. - Loaded DisGeNET disease associations (curated only). Expression Atlas ---------------- - Added columns log2foldchange and pvalue to target2disease table. - Loaded disease associations derived from Expression Atlas data. UniProt Diseases ---------------- - Moved MIM IDs from target2disease.reference to target2disease.did where datype is 'UniProt Disease'. IDG Phase 2 Flags ----------------- - Added column target.idg2 and populated with boolean flags to indicate IDG Phase 2 targets. Drug Central ------------ - Reloaded drug_activity table and 'DrugCentral Indication' diseases with updated data from Drug Central. - Recomputed and loaded TDLs. PubTator Scores --------------- - created new table ptscore: +------------+---------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +------------+---------------+------+-----+---------+----------------+ | id | int(11) | NO | PRI | NULL | auto_increment | | protein_id | int(11) | NO | MUL | NULL | | | year | int(4) | NO | | NULL | | | score | decimal(12,6) | NO | | NULL | | +------------+---------------+------+-----+---------+----------------+ - Populated ptscore with PubTator scores. - Added PubTator Score tdl_infos. KEGG Distances -------------- - Calulated and loaded all shortest path length distances between proteins in KEGG Pathways into TCRD table kegg_distance: +----------+---------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +----------+---------+------+-----+---------+----------------+ | id | int(11) | NO | PRI | NULL | auto_increment | | pid1 | int(11) | NO | MUL | NULL | | | pid2 | int(11) | NO | MUL | NULL | | | distance | int(11) | NO | | NULL | | +----------+---------+------+-----+---------+----------------+ - Based on data in kegg_distance, calculated and loaded nearest upstream and downstream Tclin target(s) into TCRD table kegg_nearest_tclin: +------------+-------------------------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +------------+-------------------------------+------+-----+---------+----------------+ | id | int(11) | NO | PRI | NULL | auto_increment | | protein_id | int(11) | NO | MUL | NULL | | | tclin_id | int(11) | NO | MUL | NULL | | | direction | enum('upstream','downstream') | NO | | NULL | | | distance | int(11) | NO | | NULL | | +------------+-------------------------------+------+-----+---------+----------------+ NB. Only for non-Tclin targets