# Debug

break point and print Advertisements

Skip to content
# Category: Uncategorized

Uncategorized
# Debug

Uncategorized
# Graphical Models

Uncategorized
# Feature Engineering

Uncategorized
# Classification

Uncategorized
# Intern Meeting Logs

Uncategorized
# Apache Hadoop (projects)

Uncategorized
# JAVA Basics

break point and print Advertisements

Bayesian Network Hidden Markov Model Random Field… Kalman Filter Partical Filter

Data Pre-processing(Transformation) Sampling under-sampling, over-sampling, increasing minority samples and decreasing majority samples simultaneously, synthesise “new” samples from the minority class, bootstrap Normalization sigmoid normalization 0-1 normalization ((bla – min(bla)) / ( max(bla) – min(bla) )) z-score Gaussian normalization (Gaussian kernel) Box-cox transformation Feature Engineering image speech text time series: entropy, approximate entropy, sample entropy plus some… Continue reading Feature Engineering

Elements of a model Objective Model structure (e.g. variables, formula, equation, parameters) Model assumptions Parameter estimates and interpretation Model fit (e.g. goodness-of-fit tests and statistics) Model selection LDA Naive Bayes Decision Tree Logistic Regression (one of GLM) Variables: Y: a binary response variable. Yi = 1 if the trait is present in observation (person, unit, etc…) i; Yi… Continue reading Classification

2017.2.17. Type of data collected from Smart Eye tracker: gaze, saccade, fixation, pursuit. We need to smooth the data firstly and calculate the velocity. Smoothing filters: moving-average( Savitzky-Golay filter, polynomial, spatial-exponential) Feature selection: LDA, FDA, mutual information, Minimum Redundancy Maximum Relevance (MRMR), map(SOM) Clustering: variational Bayesian mixture Ideas: build better features, rather than use sophisticated… Continue reading Intern Meeting Logs

QUESTIONS setInputFormat comparator top k frequent words HADOOP SYSTEM Apache Hadoop is an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware. HDFS(Hadoop distributed file system): data storage (data split and data replication) Map Reduce(data processing): how to leverage job; how do nodes communicate; how to deal with node… Continue reading Apache Hadoop (projects)

Questions about JAVA: static (shared by all objects, owned by class), combined with final (unchanged) 序列化, serializable string builder 正则表达式:regex, //s, //s+ Integer/int, Character/char? singleton Iterator iterable static block this() in constructor gnu trove script language 32-BITs SYSTEM and 64-BITs SYSTEM 232 − 1 = 4294967295 = 4 GiB − 1 32 bits will give… Continue reading JAVA Basics