Contact
- emi.tanaka@anu.edu.au
- github.com/emitanaka
- @statsgen
- For more information, please contact me via email.
Skills
- Expert: R, HTML/CSS, LaTeX
- Intermediate: Git/GitHub, Python, Bash, JS
- Languages: English (fluent) and Japanese (conversational)
Interests
- Research: experimental design, mixed models, data visualisation, visual inference, bioinformatics, statistical genetics, selective breeding, software development, statistical workflow
- Non-Research: drawing (but not good at it), reading (manga, manhwa and non-fiction books)
This resume was made with the R package pagedown.
Last updated on 2024-11-14.
Emi Tanaka
Work Experience
Biological Data Science Institute
Biological Data Science Institute
Department of Econometrics and Business Statistics
School of Mathematics and Statistics
School of Mathematics and Applied Statistics
Education
Statistical Methods for Improving Motif Evaluation
Supervisor: Dr. Uri Keich
School of Mathematics and Statistics
The University of Sydney, Sydney, Australia, 2015
Major in Mathematics and Statistics
The University of Sydney, Sydney, Australia, 2010
Publications
- Li, W, Cook, D, Tanaka, E, & VanderPlas, S (2024) A plot is worth a thousand tests: Assessing residual diagnostics with the lineup protocol. Journal of Computational and Graphical Statistics, 1-19. 10.1080/10618600.2024.2344612 Citations: 2.
- Tanaka, E (2023) edibble: An R package to encapsulate elements of experimental designs for better planning, management and workflow. 10.48550/arxiv.2311.09705
- Tanaka, E (2023) Towards a unified language in experimental designs propagated by a software framework. 10.48550/arxiv.2307.11593
- Tanaka, E (2022) Getting the most out of your experimental data with design. Biometric Bulletin, 39 (4), 9-13.
- Tanaka, E & Amaliah, D (2022) Current state and prospects of R-packages for the design of experiments. 10.48550/arxiv.2206.07532 Citations: 1.
- Amaliah, D, Cook, D, Tanaka, E, Hyde, K, & Tierney, N (2022) A journey from wild to textbook data to reproducibly refresh the wages data from the national longitudinal survey of youth database. Journal of Statistics and Data Science Education, 0 (ja), 1-27. 10.1080/26939169.2022.2094300 Citations: 2.
- Tanaka, E, Leung, J, & Cook, D (2022) Commentary on “visualization in operations management research”: Incorporating statistical thinking into visualization practices for decision making in operational management. INFORMS Journal on Data Science 10.1287/ijds.2021.0008 Citations: 1.
- Cook, D, Reid, N, & Tanaka, E (2021) The foundation is available for thinking about data visualization inferentially. Harvard Data Science Review 10.1162/99608f92.8453435d Citations: 10.
- Morota, G, Cheng, H, Cook, D, & Tanaka, E (2021) ASAS-NANP SYMPOSIUM: Prospects for interactive and dynamic graphics in the era of data-rich animal science. Journal of animal science, 99 (2) 10.1093/jas/skaa402 Citations: 12.
- Tanaka, E (2020) Simple outlier detection for a multi-environmental field trial. Biometrics, 76 (4), 1374-1382. 10.1111/biom.13216 Citations: 13.
- Tanaka, E & Hui, F (2019) Symbolic formulae for linear mixed models. Statistics and data science, 3-21. 10.1007/978-981-15-1960-4_1 Citations: 3.
- Hui, F, Tanaka, E, & Warton, D (2018) Order selection and sparsity in latent variable models via the ordered factor LASSO. Biometrics, 74 (4), 1311-1319. 10.1111/biom.12888 Citations: 30.
- Norman, A, Taylor, J, Tanaka, E, Telfer, P, Edwards, J, Martinant, J, & Kuchel, H (2017) Increased genomic prediction accuracy in wheat breeding using a large Australian panel. Theoretical and applied genetics, 130 (12), 2543-2555. 10.1007/s00122-017-2975-4 Citations: 42.
- Tanaka, E, Ral, J, Li, S, Gaire, R, Cavanagh, C, Cullis, B, & Whan, A (2017) Increased accuracy of starch granule type quantification using mixture distributions. Plant methods, 13, 107. 10.1186/s13007-017-0259-2 Citations: 9.
- Tanaka, E (2014) Statistical methods for improving motif evaluation. PhD Thesis
- Tanaka, E, Bailey, T, & Keich, U (2014) Improving MEME via a two-tiered significance analysis. Bioinformatics, 30 (14), 1965-1973. 10.1093/bioinformatics/btu163 Citations: 26.
- Liachko, I, Tanaka, E, Cox, K, Chung, S, Yang, L, Seher, A, Hallas, L, Cha, E, Kang, G, Pace, H, Barrow, J, Inada, M, Tye, B, & Keich, U (2011) Novel features of ARS selection in budding yeast lachancea kluyveri. BMC genomics, 12, 633. 10.1186/1471-2164-12-633 Citations: 26.
- Tanaka, E, Bailey, T, Grant, C, Noble, W, & Keich, U (2011) Improved similarity scores for comparing motifs. Bioinformatics, 27 (12), 1603-1609. 10.1093/bioinformatics/btr257 Citations: 64.
Citation counts are sourced from Google Scholar at 2024-11-14.
Software
My software contributions can be found at https://github.com/emitanaka.
-
edibble
An R-package that implements the grammar of experimental design.
Creator and maintainer. GitHub stars: 216
Source: https://github.com/emitanaka/edibble
Documentation: https://edibble.emitanaka.org -
flipbookr
An R-package that parses code, creates partial code builds, delivers code movie
Author. GitHub stars: 198
Source: https://github.com/EvaMaeRey/flipbookr
Documentation: https://evamaerey.github.io/flipbookr/ -
anicon
An R-package to insert animated icons for R markdown and Shiny apps.
Creator. GitHub stars: 121
Source: https://github.com/emitanaka/anicon -
simulate
An R-package for a parametric simulation framework to generate complex multivariate and multilevel data.
Creator and maintainer. GitHub stars: 1
Source: https://github.com/emitanaka/simulate -
nestr
An R-package to build hierarchical or nested sturctures.
Creator and maintainer. GitHub stars: 13
Source: https://github.com/emitanaka/nestr
Documentation: https://nestr.emitanaka.org -
monash
A utility R-package with consolidated tools and templates for staffs at Monash University.
Creator and maintainer. GitHub stars: 22
Source: https://github.com/numbats/monash -
sugar
An R-package for instructors to create a shiny app for particular unit to review grade and attendance of students.
Supervisor.GitHub stars: 2
Source: https://github.com/numbats/sugar
Documentation: https://numbats.github.io/sugar/ -
xaringan
An R package for creating slideshows with remark.js through R Markdown.
Contributor. My contribution is the adaptation of the ninja-theme shown in the Documentation. GitHub stars: 1494
Source: https://github.com/yihui/xaringan
Documentation: https://github.com/emitanaka/ninja-theme -
anidb
An R-package to fetch data from the anime database at AniDB.
Creator and maintainer. GitHub stars: 1
Source: https://github.com/emitanaka/anidb -
ggmatplot
Plot Columns of Two Matrices Against Each Other Using ‘ggplot2’
Author. GitHub stars: 5
Source: https://github.com/xuan-liang/ggmatplot
Documentation: https://xuan-liang.github.io/ggmatplot/ -
gghdr
Plots of highest density regions (HDR) for ggplot2.
Listed as author but my contribution is small. GitHub stars: 50
Source: https://github.com/ropenscilabs/gghdr
Documentation: https://ropenscilabs.github.io/gghdr -
shinycustomloader
Add a custom loader for R shiny.
Creator. GitHub stars: 119
Source: https://github.com/emitanaka/shinycustomloader -
datalegreyar
Datalegreya, the typeface that melts text and data visualisation, for R markdown.
Creator. GitHub stars: 45
Source: https://github.com/emitanaka/datalegreyar -
Tomtom
A motif comparison tool part of the MEME Suite.
The -incomplete-scores is an implementation as a result of Tanaka et al. (2011).Documentation: https://meme-suite.org/meme/doc/tomtom.html
Talks
Links to the slides (if available) are at https://emitanaka.org/talks/.
Below show the last 10 invited talks.
Workshops
Workshop | Date | Location | Host |
---|---|---|---|
Effective Data Visualisation with R | 6 Dec 2022 | Online | WOMBAT |
Data Visualisation with R | 28 Nov 2022 | Inverloch | Australasian Applied Statistics Conference |
Data Visualisation with R | 21-22 Feb 2021 | Online | methods @ manchester |
Advanced Data Visualisation with R | 8-9 Dec 2021 | Online | Statistical Society of Australia, Canberra Branch |
Data Visualisation with R | 6 Dec 2021 | Online | Statistical Society of Australia, NSW Branch |
Data Visualisation with R | 15-16 Apr 2021 | Online | Statistical Society of Australia, Canberra Branch |
Data Wrangling with R | 1-2 Dec 2020 | Online | Statistical Society of Australia, NSW Branch |
Data Visualisation with R | 11-12 Nov 2020 | Online | Statistical Society of Australia, Victoria Branch |
Data Visualisation with R | 28-29 Jul 2020 | Online | Statistical Society of Australia, Victoria Branch |
Tidyverse & R Markdown | 1 Dec 2019 | Adelaide | International Biometrics Society, Australasia Region |
Building R Packages & R Markdown | 19 Nov 2019 | Melbourne | Statistical Society of Australia, Victoria Branch |
Communicating with Data via R Markdown | 4 Oct 2019 | Sydney | COMBINE (subcommittee of ABACBS) |
Gaining skills in biostatistical consultancy | 4 Jul 2019 | Melbourne | Statistical Society of Australia, Biostatistics Section |
Statistical Methods for Omics Assisted Breeding | 12-15 Nov 2018 | Tokyo | University of Tokyo |
Teaching
- ETC5523: Communicating with Data
- ETC5512: Wild-Caught Data
- ETC5521: Explonatory Data Analysis
- STAT3012: Applied Linear Models
- STAT5002: Introduction to Statistics
- STAT906: Experimental Design
Professional Service
Professional memberships
- Statistical Society of Australia
- International Biometrics Society
- American Statistical Association
Awards & Distinctions
Australian Research Council
Australian Research Council
Australian Research Council
R Consortium
Sydney Institute of Agriculture
The University of Sydney
Statistical Society of Australia, NSW Branch