Explorations of PCAP files from contagio malware dump

Explorations of PCAP files from contagio malware dump

The goal here is to get a brief understanding of the amount of data that we're dealing with after reading all of the log files in. This includes understanding the "how-many-of-each", the "when", and the "what".

ThinkDSP, by Allen Downey (think-dsp.com)

ThinkDSP, by Allen Downey (think-dsp.com)

ThinkDSP, by Allen Downey (think-dsp.com)

Python for Data Science

Python for Data Science

by Gabriel Moreira

Introduction to Scikit-Learn

Introduction to Scikit-Learn

View this IPython Notebook:

It's-a me! Mario!

It's-a me! Mario!

How to Python Like a Boss

How to Python Like a Boss

GitHub Source

BSidesDFW 2014 - Honeypot Howto

BSidesDFW 2014 - Honeypot Howto

This notebok was used to generate graphs and analyze honeypot data for a presentation given at BsidesDFW 2014 by @sooshie (Mike Sconzo) and @theroxyd (Roxy Dehart). Notebook and data analysis performed by Mike Sconzo.

Lesson 1

Lesson 1

Create Data - We begin by creating our own data set for analysis. This prevents the end user reading this tutorial from having to download any files to replicate the results below. We will export this data set to a ...

Timeseries Classification: KNN & DTW

Timeseries Classification: KNN & DTW

Mark Regan

A Gallery of Statistical Graphs in Matplotlib

A Gallery of Statistical Graphs in Matplotlib

Companion notebook to Lecture 3 of Harvard CS109: Data Science . Prepared by Chris Beaumont

4. Doing Naive Bayes Classification

4. Doing Naive Bayes Classification

Full repo here: https://github.com/arnicas/NLP-in-Python

Tricks and Implementations

Tricks and Implementations

Resources

Introduction

Introduction

I like fractals.

Functional Geometry

Functional Geometry

Functional Geometry is a paper by Peter Henderson ( original (1982) , revisited (2002) ) which deconstructs the MC Escher woodcut Square Limit

OpenCV

OpenCV

OpenCV is a popular library in the computer vision community, being actively used in industry and academy. Started in 1999 and popularized in the following decade, OpenCV is covered in books and tutorials, so this text will not provide another ...

Machine Learning with Scikit-Learn: Validation and Model Selection

Machine Learning with Scikit-Learn: Validation and Model Selection

This notebook was put together by Jake Vanderplas for UW's Astro 599 course. Source and license info is on GitHub .

xkcd 1313: Regex Golf (Part 2: Infinite Problems)

xkcd 1313: Regex Golf (Part 2: Infinite Problems)

Peter Norvig with Stefan Pochmann February 2014

Interactive visualisations in the browser with Python

Interactive visualisations in the browser with Python

In the past few years, the python (scientific) ecosystem has seen intense development of solutions aimed at bringing interactive data visualisation in the browser, through a set of libraries which basically interface with powerful JavaScript visualisations libraries such as D3.js ...

Symbolic Methods - Lab 1

Symbolic Methods - Lab 1

Aim: write a generator function to do central finite differencing at arbitrary order, using sympy. Test it on both scripted python and compiled C code.

1 IPython notebook hints and tips

1 IPython notebook hints and tips

This notebook forms part of a series on computational optical radiometry . The notebooks can be downloaded from Github . These notebooks are constantly revised and updated, please revisit from time to time.

Introduction to (Py)Spark

Introduction to (Py)Spark

Apache Spark™ is a fast and general engine for large-scale data processing.

Supervised Learning: Support Vector Machines

Supervised Learning: Support Vector Machines

The support vector machine (SVM) is a classification method that attempts to find a hyperplane that separates classes of observations in feature space .

PCA and Facial Recognition (sklearn edition)

PCA and Facial Recognition (sklearn edition)

This is a Python rendition of principal component analysis in the context of facial recognition using the Extended Yale Faces Database B which you can download here . Originally done in R, this was written in order to experiment with ...

Using Biopython's Bio.Entrez package to Access NCBI's Entrez databases

Using Biopython's Bio.Entrez package to Access NCBI's Entrez databases

This IPython Notebook is based on Chapter 9 of the Biopython Tutorial and Cookbook , "Accessing NCBI’s Entrez databases".

Apache log analysis with Pandas

Apache log analysis with Pandas

koldunov.net

Lasagne Tutorial

Lasagne Tutorial

This tutorial assumes basic knowledge of Theano. Here is a Theano tutorial if you need to get up to speed.

Introduction to (Py)Spark

Introduction to (Py)Spark

Apache Spark™ is a fast and general engine for large-scale data processing.

Data Science with Hadoop - predicting airline delays - part 1

Data Science with Hadoop - predicting airline delays - part 1

In this first blog post we will demonstrate a step by step solution to a supervised learning problem, including:

Instructions for Downloading Images: http://help.brain-map.org/display/api/Downloading+an+Image

Instructions for Downloading Images: http://help.brain-map.org/display/api/Downloading+an+Image

Instructions for Downloading Images: http://help.brain-map.org/display/api/Downloading+an+Image

Key differences between Python 2.7.x and Python 3.x

Key differences between Python 2.7.x and Python 3.x

Sebastian Raschka