EHR records are generated and captured during routine clinical interactions and require a significant amount of pre-processing in order to be transformed to research-ready datasets for statistical analysis. The talk will cover methods and tools for creating and validating disease phenotypes from linked electronic health records for research.