Patorn Utenpattanun patorn

Lecture 1: Introduction to Research — [📝Lecture Notebooks] [▶️Video]
Lecture 2: Introduction to Python — [📝Lecture Notebooks] [▶️Video]
Lecture 3: Introduction to NumPy — [📝Lecture Notebooks] [▶️Video]
Lecture 4: Introduction to pandas — [📝Lecture Notebooks] [▶️Video]
Lecture 5: Plotting Data — [📝Lecture Notebooks] [[▶️Vide

Database Naming Convention and Data Warehouse Design Principles

[TOC]

http://financials.morningstar.com/ajax/exportKR2CSV.html?t=<market>:<stock>

Market

XHKG: Hong Kong Stock Exchange

XASE: American Stock Exchange

XNAS: Nasdaq Stock Exchange >* XNYS: New York Stock Exchange

|
|\_ app
|...
|\_ docker
| |

The Guardian offers an API as deep and robust as the New York Times Article API when it comes to content analysis.

The Guardian's API offers more than "1.7 million pieces of content", with published items as far back as 1999. You can register as a developer here, which gets you 5,000 API hits a day and an API key that looks something like this:

zzzyyyyy-9a9z-999z-z999-9e8a83922516

The Guardian has a handy interactive explorer to interactively tweak the query parameters.

Tested with Apache Spark 2.1.0, Python 2.7.13 and Java 1.8.0_112

For older versions of Spark and ipython, please, see also previous version of text.

	# A simple cheat sheet of Spark Dataframe syntax
	# Current for Spark 1.6.1

	# import statements
	from pyspark.sql import SQLContext
	from pyspark.sql.types import *
	from pyspark.sql.functions import *

	#creating dataframes
	df = sqlContext.createDataFrame([(1, 4), (2, 5), (3, 6)], ["A", "B"]) # from manual data

	"""Parse Salesforce report data in Python

	details in my answer https://stackoverflow.com/a/45645135/448474
	"""
	from collections import OrderedDict
	from simple_salesforce import Salesforce
	import pandas as pd
	import json