Chapter 12 Spatial data and maps

12.2 Making maps

ggplot ggmap scale and north arrow

12.3 Doing spatial analysis

polygon area counting points in an area nearest neighbour

12.4

Almiron, Marcelo G., Eliana S. Almeida, and Marcio N. Miranda. 2009. “The Reliability of Statistical Functions in Four Software Packages Freely Used in Numerical Computation.” Braz. J. Probab. Stat. 23 (2). Brazilian Statistical Association: 107–19. https://doi.org/10.1214/08-BJPS017.

Baggerly, Keith A, and Kevin R Coombes. 2009. “Deriving Chemosensitivity from Cell Lines: Forensic Bioinformatics and Reproducible Research in High-Throughput Biology.” The Annals of Applied Statistics. JSTOR, 1309–34.

Baker, Monya. 2016. “Muddled Meanings Hamper Efforts to Fix Reproducibility Crisis.” Nature News. https://doi.org/10.1038/nature.2016.20076.

Barnes, Nick. 2010. “Publish Your Computer Code: It Is Good Enough.” Nature 467 (7317). Nature Publishing Group: 753–53.

Bateman, Scott, Regan L Mandryk, Carl Gutwin, Aaron Genest, David McDine, and Christopher Brooks. 2010. “Useful Junk?: The Effects of Visual Embellishment on Comprehension and Memorability of Charts.” In Proceedings of the Sigchi Conference on Human Factors in Computing Systems, 2573–82. ACM.

Begley, C. Glenn, and Lee M. Ellis. 2012. “Drug Development: Raise Standards for Preclinical Cancer Research.” Nature 483 (7391): 531–33. https://doi.org/10.1038/483531a.

Bond-Lamberty, Ben, A Peyton Smith, and Vanessa Bailey. 2016. “Running an Open Experiment: Transparency and Reproducibility in Soil and Ecosystem Science.” Environmental Research Letters 11 (8). IOP Publishing: 084004.

Borer, Elizabeth T., Eric W. Seabloom, Matthew B. Jones, and Mark Schildhauer. 2009. “Some Simple Guidelines for Effective Data Management.” The Bulletin of the Ecological Society of America 90 (2). The Ecological Society of America: 205–14. https://doi.org/10.1890/0012-9623-90.2.205.

Braun, W John, and Duncan J Murdoch. 2016. A First Course in Statistical Programming with R. Cambridge University Press.

Brischoux, François, and Pierre Legagneux. 2009. “Don’t Format Manuscripts: Journals Should Use a Generic Submission Format Until Papers Are Accepted.” The Scientist 23 (7): 24.

Carswell, C Melody. 1992. “Choosing Specifiers: An Evaluation of the Basic Tasks Model of Graphical Perception.” Human Factors: The Journal of the Human Factors and Ergonomics Society 34 (5). Sage Publications: 535–54.

Clarkson, Chris, Mike Smith, Ben Marwick, Richard Fullagar, Lynley A Wallis, Patrick Faulkner, Tiina Manne, Elspeth Hayes, Richard G Roberts, and Zenobia Jacobs. 2015. “The Archaeology, Chronology and Stratigraphy of Madjedbebe (Malakunanja Ii): A Site in Northern Australia with Early Occupation.” Journal Article. Journal of Human Evolution 83: 46–64.

Cleveland, William S., and Robert McGill. 1984. “Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods.” Journal Article. Journal of the American Statistical Association 79 (387): 531–54. http://links.jstor.org/sici?sici=0162-1459%28198409%2979%3A387%3C531%3AGPTEAA%3E2.0.CO%3B2-Y.

Cottrell, Allin. 1999. “Word Processors: Stupid and Inefficient.” Essay, http://www.ecn.wfu.edu/cottrell/wp.html.

De Vries, Andrie, and Joris Meys. 2015. R for Dummies. John Wiley & Sons.

Ehrenberg, ASC. 1977. “Rudiments of Numeracy.” Journal of the Royal Statistical Society. Series A (General). JSTOR, 277–97.

Feinberg, Richard A, and Howard Wainer. 2011. “Extracting Sunbeams from Cucumbers.” Journal of Computational and Graphical Statistics 20 (4). Taylor & Francis: 793–810.

Feldman-Stewart, Deb, Nancy Kocovski, Beth A. McConnell, Michael D. Brundage, and William J. Mackillop. 2000. “Perception of Quantitative Information for Treatment Decisions.” Journal Article. Medical Decision Making 20 (2): 228–38. https://doi.org/10.1177/0272989x0002000208.

Ferris, Neal. 1999. “What’s in a Name? The Implications of Archaeological Terminology Used in Nonarchaeological Contexts.” Taming the Taxonomy: Toward a New Understanding of Great Lakes Archaeology, Edited by Ronald Williamson and Christopher Watts, 111–21.

Fitzgerald, Michael. 2012. Introducing Regular Expressions. "O’Reilly Media, Inc.".

Friedl, Jeffrey EF. 2002. Mastering Regular Expressions. "O’Reilly Media, Inc.".

Gandrud, Christopher. 2015. Reproducible Research with R and R Studio, Second Edition. 2nd ed. Chapman and Hall Crc the R Series. Chapman; Hall CRC. http://gen.lib.rus.ec/book/index.php?md5=89E3848976A5DFAC000A892AA29FFE8D.

Gelman, Andrew, Cristian Pasarica, and Rahul Dodhia. 2002. “Let’s Practice What We Preach: Turning Tables into Graphs.” The American Statistician 56 (2). Taylor & Francis: 121–30.

Gentleman, Robert, and Duncan Temple Lang. 2007. “Statistical Analyses and Reproducible Research.” Journal of Computational and Graphical Statistics 16 (1). Taylor & Francis: 1–23.

Gillan, Douglas J, and Edward H Richman. 1994. “Minimalism and the Syntax of Graphs.” Human Factors: The Journal of the Human Factors and Ergonomics Society 36 (4). SAGE Publications: 619–44.

Gillan, Douglas J., Edward Richman, and Michael Neary. 1992. “Minimalism in Graphics.” In Posters and Short Talks of the 1992 Sigchi Conference on Human Factors in Computing Systems, 75–76. CHI ’92. New York, NY, USA: ACM. https://doi.org/10.1145/1125021.1125090.

Goodman, Steven N., Daniele Fanelli, and John P. A. Ioannidis. 2016. “What Does Research Reproducibility Mean?” Science Translational Medicine 8 (341). American Association for the Advancement of Science: 341ps12–341ps12. https://doi.org/10.1126/scitranslmed.aaf5027.

Hampton, Stephanie E., Sean S. Anderson, Sarah C. Bagby, Corinna Gries, Xueying Han, Edmund M. Hart, Matthew B. Jones, et al. 2015. “The Tao of Open Science for Ecology.” Ecosphere 6 (7): art120.

Hampton, Stephanie E, Carly A Strasser, Joshua J Tewksbury, Wendy K Gram, Amber E Budden, Archer L Batcheller, Clifford S Duke, and John H Porter. 2013. “Big Data and the Future of Ecology.” Frontiers in Ecology and the Environment 11 (3). Ecological Society of America: 156–62. https://doi.org/10.1890/120103.

Haslam, Michael, Chris Clarkson, Richard G. Roberts, Janardhana Bora, Ravi Korisettar, Peter Ditchfield, Allan R. Chivas, et al. 2012. “A Southern Indian Middle Palaeolithic Occupation Surface Sealed by the 74 Ka Toba Eruption: Further Evidence from Jwalapuram Locality 22.” Journal Article. Quaternary International 258: 148–64. https://doi.org/10.1016/j.quaint.2011.08.040.

Heer, Jeffrey, and Michael Bostock. 2010. “Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.” In Proceedings of the Sigchi Conference on Human Factors in Computing Systems, 203–12. ACM.

Herndon, Thomas, Michael Ash, and Robert Pollin. 2013. “Does High Public Debt Consistently Stifle Economic Growth? A Critique of Reinhart and Rogoff.” Cambridge Journal of Economics. https://doi.org/10.1093/cje/bet075.

Hoefling, Holger, and Anthony Rossini. 2014. “Reproducible Research for Large-Scale Data Analysis.” In Implementing Reproducible Research, edited by Victoria Stodden, Friedrich Leisch, and Roger D Peng, 1–17. CRC Press.

Hullman, Jessica, Eytan Adar, and Priti Shah. 2011. “Benefitting Infovis with Visual Difficulties.” IEEE Transactions on Visualization and Computer Graphics 17 (12). IEEE: 2213–22.

Inbar, Ohad, Noam Tractinsky, and Joachim Meyer. 2007. “Minimalism in Information Visualization: Attitudes Towards Maximizing the Data-Ink Ratio.” In Proceedings of the 14th European Conference on Cognitive Ergonomics: Invent! Explore!, 185–88. ECCE ’07. New York, NY, USA: ACM. https://doi.org/10.1145/1362550.1362587.

Ince, Darrel C., Leslie Hatton, and John Graham-Cumming. 2012. “The Case for Open Computer Programs.” Nature 482 (7386): 485–88.

Jonge, Edwin de, and Mark van der Loo. 2013. “An Introduction to Data Cleaning with R.” Statistics Netherlands, the Hauge.

Keeling, Kellie B, and Robert J Pavur. 2007. “A Comparative Study of the Reliability of Nine Statistical Software Packages.” Computational Statistics & Data Analysis 51 (8). Elsevier: 3811–31.

Keller, Gerald. 2000. Applied Statistics with Microsoft Excel. Duxbury.

Kelly, James D. 1989. “The Data-Ink Ratio and Accuracy of Newspaper Graphs.” Journalism and Mass Communication Quarterly 66 (3). Association for Education in Journalism, etc.: 632.

Kleindienst, MR. 2006. “On Naming Things.” Transitions Before the Transition. Springer, 13–28.

Knuth, Donald E. 1992. “Literate Programming.” CSLI Lecture Notes, Stanford, CA: Center for the Study of Language and Information (CSLI), 1992 1.

Koenker, Roger, and Achim Zeileis. 2009. “On Reproducible Econometric Research.” Journal of Applied Econometrics 24 (5). Wiley Online Library: 833–47.

Kosara, Robert, and Caroline Ziemkiewicz. 2010. “Do Mechanical Turks Dream of Square Pie Charts?” In Proceedings of the 3rd Beliv’10 Workshop: BEyond Time and Errors: Novel evaLuation Methods for Information Visualization, 63–70. BELIV ’10. New York, NY, USA: ACM. https://doi.org/10.1145/2110192.2110202.

Kosslyn, Stephen M. 1985. “Graphics and Human Information Processing: A Review of Five Books.” Journal of the American Statistical Association 80 (391). Taylor & Francis: 499–512.

Kosslyn, Stephen M, and Christopher F Chabris. 1992. “Minding Information Graphics.” Folio: The Magazine for Magazine Management 21 (2): 69–71.

Kulla-Mader, Julia. 2007. “Graphs via Ink: Understanding How the Amount of Non-Data-Ink in a Graph Affects Perception and Learning.” Master’s Thesis, Department of Information and Library Science, University of North Carolina.

Li, Huiyang, and Nadine Moacdieh. 2014. “Is ‘Chart Junk’ Useful? An Extended Examination of Visual Embellishment.” Journal Article. Proceedings of the Human Factors and Ergonomics Society Annual Meeting 58 (1): 1516–20. https://doi.org/10.1177/1541931214581316.

Lowndes, Julia S Stewart, Benjamin D Best, Courtney Scarborough, Jamie C Afflerbach, Melanie R Frazier, Casey C O’Hara, Ning Jiang, and Benjamin S Halpern. 2017. “Our Path to Better Science in Less Time Using Open Data Science Tools.” Nature Ecology & Evolution 1. Nature Publishing Group: 0160.

Marwick, Ben. 2016. “Computational Reproducibility in Archaeological Research: Basic Principles and a Case Study of Their Implementation.” Journal of Archaeological Method and Theory. Springer US, 1–27. https://doi.org/10.1007/s10816-015-9272-9.

Marwick, Ben, Chris Clarkson, Sue O’Connor, and Sophie Collins. 2016. “Early Modern Human Lithic Technology from Jerimalai, East Timor.” Journal Article. Journal of Human Evolution 101: 45–64.

Marwick, Ben, Hannah G. Van Vlack, Cyler Conrad, Rasmi Shoocongdej, Cholawit Thongcharoenchaikit, and Seungki Kwak. 2017. “Adaptations to Sea Level Change and Transitions to Agriculture at Khao Toh Chong Rockshelter, Peninsular Thailand.” Journal Article. Journal of Archaeological Science 77: 94–108. https://doi.org/10.1016/j.jas.2016.10.010.

Matloff, Norman. 2011. The Art of R Programming: A Tour of Statistical Software Design. Book. No Starch Press.

McCullough, Bruce D, and David A Heiser. 2008. “On the Accuracy of Statistical Procedures in Microsoft Excel 2007.” Computational Statistics & Data Analysis 52 (10). Elsevier: 4570–8.

McGurgan, Kevin. 2015. “Data-Ink Ratio and Task Complexity in Graph Comprehension.” Master’s Thesis, Department of Psychology, Rochester Institute of Technology.

Meyer, Joachim, David Shinar, and David Leiser. 1997. “Multiple Factors That Determine Performance with Tables and Graphs.” Human Factors: The Journal of the Human Factors and Ergonomics Society 39 (2). SAGE Publications: 268–86.

Morin, A., J. Urban, P. D. Adams, I. Foster, A. Sali, D. Baker, and P. Sliz. 2012. “Shining Light into Black Boxes.” Science 336 (6078): 159–60. https://doi.org/10.1126/science.1218263.

Murrell, Paul. 2016. R Graphics. CRC Press.

Noble, William Stafford. 2009. “A Quick Guide to Organizing Computational Biology Projects.” PLOS Computational Biology 5 (7). Public Library of Science: 1–5. https://doi.org/10.1371/journal.pcbi.1000424.

Nosek, B. A., G. Alter, G. C. Banks, D. Borsboom, S. D. Bowman, S. J. Breckler, S. Buck, et al. 2015. “Promoting an Open Research Culture: Author Guidelines for Journals Could Help to Promote Transparency, Openness, and Reproducibility.” Science (New York, N.Y.) 348 (6242): 1422–5.

Prinz, Florian, Thomas Schlange, and Khusru Asadullah. 2011. “Believe It or Not: How Much Can We Rely on Published Data on Potential Drug Targets?” Nat Rev Drug Discov 10 (9): 712–12. https://doi.org/10.1038/nrd3439-c1.

Racine, Jeffrey S. 2012. “RStudio: A Platform-Independent Ide for R and Sweave.” Journal of Applied Econometrics 27 (1). Wiley Online Library: 167–72.

Rangecroft, Margaret. 2003. “As Easy as Pie.” Journal Article. Behaviour & Information Technology 22 (6): 421–26. https://doi.org/10.1080/01449290310001615437.

Reichman, OJ, Matthew B Jones, and Mark P Schildhauer. 2011. “Challenges and Opportunities of Open Data in Ecology.” Science 331 (6018).

Rossini, AJ, Thomas Lumley, and Friedrich Leisch. 2003. “On the Edge: Statistics & Computing: Reproducible Statistical Research.” Chance 16 (2). Taylor & Francis Group: 41–45.

Sandve, Anton AND Taylor, Geir Kjetil AND Nekrutenko. 2013. “Ten Simple Rules for Reproducible Computational Research.” PLoS Comput Biol 9 (10). Public Library of Science: e1003285. https://doi.org/10.1371/journal.pcbi.1003285.

Schonlau, Matthias, and Ellen Peters. 2012. “Comprehension of Graphs and Tables Depend on the Task: Empirical Evidence from Two Web-Based Studies.” Statistics, Politics, and Policy 3 (2).

Schulte, Eric, Dan Davison, Thomas Dye, and Carsten Dominik. 2012. “A Multi-Language Computing Environment for Literate Programming and Reproducible Research.” Journal of Statistical Software 46 (1): 1–24. https://doi.org/10.18637/jss.v046.i03.

Simkin, David, and Reid Hastie. 1987. “An Information-Processing Analysis of Graph Perception.” Journal Article. Journal of the American Statistical Association 82 (398): 454–65. https://doi.org/10.1080/01621459.1987.10478448.

Spence, Ian. 1990. “Visual Psychophysics of Simple Graphical Elements.” Journal of Experimental Psychology: Human Perception and Performance 16 (4). American Psychological Association: 683.

———. 2005. “No Humble Pie: The Origins and Usage of a Statistical Chart.” Journal of Educational and Behavioral Statistics 30 (4). Sage Publications: 353–68.

Spence, Ian, and Stephan Lewandowsky. 1991. “Displaying Proportions and Percentages.” Applied Cognitive Psychology 5 (1). Wiley Online Library: 61–77.

Steen, Arturo AND Fang, R. Grant AND Casadevall. 2013. “Why Has the Number of Scientific Retractions Increased?” PLoS ONE 8 (7). Public Library of Science: e68397. https://doi.org/10.1371/journal.pone.0068397.

Stodden, Victoria, Friedrich Leisch, and Roger D Peng. 2014. Implementing Reproducible Research. Chapman; Hall/CRC.

Strasser, C. A., and S. E. Hampton. 2012. “The Fractured Lab Notebook: Undergraduates and Ecological Data Management Training in the United States.” Ecosphere 3 (12). Ecological Society of America: 1–18. https://doi.org/10.1890/ES12-00139.1.

Sutter, Robert D., Susan B. Wainscott, John R. Boetsch, Craig J. Palmer, and David J. Rugg. 2015. “Practical Guidance for Integrating Data Management into Long-Term Ecological Monitoring Projects.” Wildlife Society Bulletin 39 (3): 451–63. https://doi.org/10.1002/wsb.548.

Talbot, Justin, Vidya Setlur, and Anushka Anand. 2014. “Four Experiments on the Perception of Bar Charts.” IEEE Transactions on Visualization and Computer Graphics 20 (12). IEEE: 2152–60.

Terry, Richard E., Sheldon D. Nelson, Jared Carr, Jacob Parnell, Perry J. Hardin, Mark W. Jackson, and Stephen D. Houston. 2000. “Quantitative Phosphorus Measurement: A Field Test Procedure for Archaeological Site Analysis at Piedras Negras, Guatemala.” Journal Article. Geoarchaeology 15 (2): 151–66. http://dx.doi.org/10.1002/(SICI)1520-6548(200002)15:2<151::AID-GEA3>3.0.CO;2-T.

Tukey, John W. 1990. “Data-Based Graphics: Visual Display in the Decades to Come.” Journal Article. Statistical Science 5 (3): 327–39. http://www.jstor.org/stable/2245820.

Van Noorden, Richard. 2011. “Science Publishing: The Trouble with Retractions.” Nature 478 (9): 26–28. https://doi.org/10.1038/478026a.

Verzani, John. 2011. Getting Started with Rstudio. " O’Reilly Media, Inc.".

Volk, Carol J., Yasmin Lucero, and Katie Barnas. 2014. “Why Is Data Sharing in Collaborative Natural Resource Efforts so Hard and What Can We Do to Improve It?” Environmental Management 53 (5): 883–93. https://doi.org/10.1007/s00267-014-0258-2.

Wainer, Howard. 1997. “Improving Tabular Displays, with Naep Tables as Examples and Inspirations.” Journal of Educational and Behavioral Statistics 22 (1): 1–30. https://doi.org/10.3102/10769986022001001.

White, Ethan, Elita Baldridge, Zachary Brym, Kenneth Locey, Daniel McGlinn, and Sarah Supp. 2013. “Nine Simple Ways to Make It Easier to (Re)use Your Data.” Ideas in Ecology and Evolution 6 (2). https://doi.org/10.4033/iee.v6i2.4608.

Wickham, Hadley. 2014. Advanced R. CRC Press.

———. 2016. Ggplot2: Elegant Graphics for Data Analysis. Springer.

Wickham, Hadley, and Garrett Grolemund. 2016. “R for Data Science.” Sebastopol, CA: O’Reilly. http://​ r4ds.​ had.​ co.​ nz.

Wickham, Hadley, and others. 2014. “Tidy Data.” Journal of Statistical Software 59 (10). Foundation for Open Access Statistics: 1–23.

Wilson, Greg, D. A. Aruliah, C. Titus Brown, Neil P. Chue Hong, Matt Davis, Richard T. Guy, Steven H. D. Haddock, et al. 2014. “Best Practices for Scientific Computing.” PLoS Biol 12 (1).

Xie, Yihui. 2015. Dynamic Documents with R and Knitr. 2nd ed. Boca Raton, Florida: Chapman; Hall/CRC. http://yihui.name/knitr/.

———. 2016. Bookdown: Authoring Books and Technical Documents with R Markdown. CRC Press.

Yalta, A Talha. 2008. “The Accuracy of Statistical Distributions in Microsoft Excel 2007.” Computational Statistics & Data Analysis 52 (10). Elsevier: 4579–86.

Zacks, Jeff, Ellen Levy, Barbara Tversky, and Diane J Schiano. 1998. “Reading Bar Graphs: Effects of Extraneous Depth Cues and Graphical Context.” Journal of Experimental Psychology: Applied 4 (2). American Psychological Association: 119.

Zubiaga, Arkaitz, and Brian MacNamee. 2015. “Knowing What You Dont Know: Choosing the Right Chart to Show Data Distributions to Non-Expert Users.” In Web Science 2015 Conference, Oxford, United Kingdom, 28 June-1 July 2015. ACM.