Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV NEXT
FRAMES
NO FRAMES
All Classes
A
B
C
D
E
F
G
H
I
M
N
O
P
Q
R
S
T
U
W
A
accumulate(Tuple)
- Method in class datafu.pig.linkanalysis.
PageRank
accumulate(Tuple)
- Method in class datafu.pig.sessions.
Sessionize
accumulate(Tuple)
- Method in class datafu.pig.stats.
StreamingQuantile
addEdges(Integer, ArrayList<Map<String, Object>>)
- Method in class datafu.linkanalysis.
PageRank
AliasBagFields
- Class in
datafu.pig.bags
Re-alias the fields inside of a bag.
AliasBagFields(String)
- Constructor for class datafu.pig.bags.
AliasBagFields
all_equal(PriorityQueue<SetIntersect.pair>)
- Method in class datafu.pig.bags.sets.
SetIntersect
AppendToBag
- Class in
datafu.pig.bags
Appends a tuple to a bag.
AppendToBag()
- Constructor for class datafu.pig.bags.
AppendToBag
ASSERT
- Class in
datafu.pig.util
Asserts some boolean.
ASSERT()
- Constructor for class datafu.pig.util.
ASSERT
B
BagConcat
- Class in
datafu.pig.bags
Concatenates the tuples from a set of bags, producing a single bag containing all tuples.
BagConcat()
- Constructor for class datafu.pig.bags.
BagConcat
BagSplit
- Class in
datafu.pig.bags
Splits a bag of tuples into a bag of bags, where the inner bags collectively contain the tuples from the original bag.
BagSplit()
- Constructor for class datafu.pig.bags.
BagSplit
BagSplit(String)
- Constructor for class datafu.pig.bags.
BagSplit
binconf(Long, Long)
- Method in class datafu.pig.stats.
WilsonBinConf
BoolToInt
- Class in
datafu.pig.util
UDF which converts a Boolean to an Integer.
BoolToInt()
- Constructor for class datafu.pig.util.
BoolToInt
C
call(DataBag)
- Method in class datafu.pig.bags.
AliasBagFields
call(DataBag, Tuple)
- Method in class datafu.pig.bags.
AppendToBag
call(DataBag)
- Method in class datafu.pig.bags.
Enumerate
call(DataBag, Tuple)
- Method in class datafu.pig.bags.
PrependToBag
call(DataBag)
- Method in class datafu.pig.date.
TimeCount
call(Double, Double, Double, Double)
- Method in class datafu.pig.geo.
HaversineDistInMiles
call(String)
- Method in class datafu.pig.hash.
MD5
call(String)
- Method in class datafu.pig.hash.
MD5Base64
call(Integer, Integer)
- Method in class datafu.pig.numbers.
RandInt
call(DataBag)
- Method in class datafu.pig.stats.
Quantile
call(DataBag)
- Method in class datafu.pig.stats.
StreamingQuantile
call(Number, Number)
- Method in class datafu.pig.stats.
WilsonBinConf
call(String)
- Method in class datafu.pig.urls.
UserAgentClassify
call(Boolean)
- Method in class datafu.pig.util.
BoolToInt
call(Integer)
- Method in class datafu.pig.util.
IntToBool
cleanup()
- Method in class datafu.pig.linkanalysis.
PageRank
cleanup()
- Method in class datafu.pig.sessions.
Sessionize
cleanup()
- Method in class datafu.pig.stats.
StreamingQuantile
clear()
- Method in class datafu.linkanalysis.
PageRank
commit(PageRank.ProgressIndicator)
- Method in class datafu.linkanalysis.
PageRank
D
datafu.linkanalysis
- package datafu.linkanalysis
datafu.pig.bags
- package datafu.pig.bags
datafu.pig.bags.sets
- package datafu.pig.bags.sets
datafu.pig.date
- package datafu.pig.date
datafu.pig.geo
- package datafu.pig.geo
datafu.pig.hash
- package datafu.pig.hash
datafu.pig.linkanalysis
- package datafu.pig.linkanalysis
datafu.pig.numbers
- package datafu.pig.numbers
datafu.pig.sessions
- package datafu.pig.sessions
datafu.pig.stats
- package datafu.pig.stats
datafu.pig.urls
- package datafu.pig.urls
datafu.pig.util
- package datafu.pig.util
disableDanglingNodeHandling()
- Method in class datafu.linkanalysis.
PageRank
Disables dangling node handling (disabled by default).
disableEdgeDiskCaching()
- Method in class datafu.linkanalysis.
PageRank
Disable disk caching of edges once there are too many (disabled by default).
DistinctBy
- Class in
datafu.pig.bags
Get distinct elements in a bag by a given set of field positions.
DistinctBy(String...)
- Constructor for class datafu.pig.bags.
DistinctBy
distribute(PageRank.ProgressIndicator)
- Method in class datafu.linkanalysis.
PageRank
E
EARTH_RADIUS
- Static variable in class datafu.pig.geo.
HaversineDistInMiles
edgeCount()
- Method in class datafu.linkanalysis.
PageRank
enableDanglingNodeHandling()
- Method in class datafu.linkanalysis.
PageRank
Enables dangling node handling (disabled by default).
enableEdgeDiskCaching()
- Method in class datafu.linkanalysis.
PageRank
Enable disk caching of edges once there are too many (disabled by default).
Enumerate
- Class in
datafu.pig.bags
Enumerate through a bag, replacing each (elem) with (elem, idx).
Enumerate()
- Constructor for class datafu.pig.bags.
Enumerate
Enumerate(String)
- Constructor for class datafu.pig.bags.
Enumerate
Enumerate(String, String)
- Constructor for class datafu.pig.bags.
Enumerate
exec(Tuple)
- Method in class datafu.pig.bags.
BagConcat
exec(Tuple)
- Method in class datafu.pig.bags.
BagSplit
exec(Tuple)
- Method in class datafu.pig.bags.
DistinctBy
exec(Tuple)
- Method in class datafu.pig.bags.
FirstTupleFromBag
exec(Tuple)
- Method in class datafu.pig.bags.
NullToEmptyBag
exec(Tuple)
- Method in class datafu.pig.bags.sets.
SetIntersect
exec(Tuple)
- Method in class datafu.pig.bags.sets.
SetUnion
exec(Tuple)
- Method in class datafu.pig.bags.
UnorderedPairs
exec(Tuple)
- Method in class datafu.pig.linkanalysis.
PageRank
exec(Tuple)
- Method in class datafu.pig.sessions.
Sessionize
exec(Tuple)
- Method in class datafu.pig.stats.
MarkovPairs
exec(Tuple)
- Method in class datafu.pig.util.
ASSERT
exec(Tuple)
- Method in class datafu.pig.util.
SimpleEvalFunc
F
FirstTupleFromBag
- Class in
datafu.pig.bags
FirstTupleFromBag()
- Constructor for class datafu.pig.bags.
FirstTupleFromBag
G
getEdgeCachingThreshold()
- Method in class datafu.linkanalysis.
PageRank
Gets the number of edges past which they will be cached on disk instead of in memory.
getNodeIds()
- Method in class datafu.linkanalysis.
PageRank
getNodeRank(int)
- Method in class datafu.linkanalysis.
PageRank
getQuantilesFromParams(String...)
- Static method in class datafu.pig.stats.
QuantileUtil
getReturnType()
- Method in class datafu.pig.util.
SimpleEvalFunc
getTotalRankChange()
- Method in class datafu.linkanalysis.
PageRank
getValue()
- Method in class datafu.pig.linkanalysis.
PageRank
getValue()
- Method in class datafu.pig.sessions.
Sessionize
getValue()
- Method in class datafu.pig.stats.
StreamingQuantile
H
HaversineDistInMiles
- Class in
datafu.pig.geo
Computes the distance (in miles) between two latitude-longitude pairs using the
Haversine formula
.
HaversineDistInMiles()
- Constructor for class datafu.pig.geo.
HaversineDistInMiles
I
init(PageRank.ProgressIndicator)
- Method in class datafu.linkanalysis.
PageRank
IntToBool
- Class in
datafu.pig.util
UDF which converts an Integer to a Boolean.
IntToBool()
- Constructor for class datafu.pig.util.
IntToBool
isEdgeDiskCachingEnabled()
- Method in class datafu.linkanalysis.
PageRank
Gets whether edge disk caching is enabled.
isUsingEdgeDiskCache()
- Method in class datafu.linkanalysis.
PageRank
Gets whether disk is being used to cache edges.
M
MarkovPairs
- Class in
datafu.pig.stats
Accepts a bag of tuples, with user supplied ordering, and generates pairs that can be used for a Markov chain analysis.
MarkovPairs()
- Constructor for class datafu.pig.stats.
MarkovPairs
MarkovPairs(String)
- Constructor for class datafu.pig.stats.
MarkovPairs
MD5
- Class in
datafu.pig.hash
Computes the MD5 value of a string and outputs it in hex.
MD5()
- Constructor for class datafu.pig.hash.
MD5
MD5Base64
- Class in
datafu.pig.hash
Computes the MD5 value of a string and outputs it in Base64 encoding.
MD5Base64()
- Constructor for class datafu.pig.hash.
MD5Base64
Median
- Class in
datafu.pig.stats
Computes the
median
for a
sorted
input bag, using type R-2 estimation.
Median()
- Constructor for class datafu.pig.stats.
Median
N
nextIteration(PageRank.ProgressIndicator)
- Method in class datafu.linkanalysis.
PageRank
nodeCount()
- Method in class datafu.linkanalysis.
PageRank
NullToEmptyBag
- Class in
datafu.pig.bags
UDF that, if the input is null, returns an empty bag; otherwise, returns the input bag unchanged.
NullToEmptyBag()
- Constructor for class datafu.pig.bags.
NullToEmptyBag
O
outputSchema(Schema)
- Method in class datafu.pig.bags.
AliasBagFields
outputSchema(Schema)
- Method in class datafu.pig.bags.
AppendToBag
outputSchema(Schema)
- Method in class datafu.pig.bags.
BagConcat
outputSchema(Schema)
- Method in class datafu.pig.bags.
BagSplit
outputSchema(Schema)
- Method in class datafu.pig.bags.
DistinctBy
outputSchema(Schema)
- Method in class datafu.pig.bags.
Enumerate
outputSchema(Schema)
- Method in class datafu.pig.bags.
FirstTupleFromBag
outputSchema(Schema)
- Method in class datafu.pig.bags.
NullToEmptyBag
outputSchema(Schema)
- Method in class datafu.pig.bags.
PrependToBag
outputSchema(Schema)
- Method in class datafu.pig.bags.sets.
SetOperationsBase
outputSchema(Schema)
- Method in class datafu.pig.bags.
UnorderedPairs
outputSchema(Schema)
- Method in class datafu.pig.geo.
HaversineDistInMiles
outputSchema(Schema)
- Method in class datafu.pig.linkanalysis.
PageRank
outputSchema(Schema)
- Method in class datafu.pig.numbers.
RandInt
outputSchema(Schema)
- Method in class datafu.pig.sessions.
Sessionize
outputSchema(Schema)
- Method in class datafu.pig.stats.
MarkovPairs
outputSchema(Schema)
- Method in class datafu.pig.stats.
Quantile
outputSchema(Schema)
- Method in class datafu.pig.stats.
StreamingQuantile
outputSchema(Schema)
- Method in class datafu.pig.stats.
WilsonBinConf
P
PageRank
- Class in
datafu.linkanalysis
An implementation of
PageRank
.
PageRank()
- Constructor for class datafu.linkanalysis.
PageRank
PageRank
- Class in
datafu.pig.linkanalysis
A UDF which implements
PageRank
.
PageRank()
- Constructor for class datafu.pig.linkanalysis.
PageRank
PageRank(String...)
- Constructor for class datafu.pig.linkanalysis.
PageRank
PageRank.ProgressIndicator
- Interface in
datafu.linkanalysis
PrependToBag
- Class in
datafu.pig.bags
Prepends a tuple to a bag.
PrependToBag()
- Constructor for class datafu.pig.bags.
PrependToBag
progress()
- Method in interface datafu.linkanalysis.
PageRank.ProgressIndicator
Q
Quantile
- Class in
datafu.pig.stats
Computes
quantiles
for a
sorted
input bag, using type R-2 estimation.
Quantile(String...)
- Constructor for class datafu.pig.stats.
Quantile
QuantileUtil
- Class in
datafu.pig.stats
QuantileUtil()
- Constructor for class datafu.pig.stats.
QuantileUtil
R
RandInt
- Class in
datafu.pig.numbers
Generates a uniformly distributed integer between two bounds
RandInt()
- Constructor for class datafu.pig.numbers.
RandInt
S
Sessionize
- Class in
datafu.pig.sessions
Sessionizes an input stream.
Sessionize(String)
- Constructor for class datafu.pig.sessions.
Sessionize
setEdgeCachingThreshold(long)
- Method in class datafu.linkanalysis.
PageRank
Set the number of edges past which they will be cached on disk instead of in memory.
SetIntersect
- Class in
datafu.pig.bags.sets
Computes the set intersection of two or more bags.
SetIntersect()
- Constructor for class datafu.pig.bags.sets.
SetIntersect
SetOperationsBase
- Class in
datafu.pig.bags.sets
SetOperationsBase()
- Constructor for class datafu.pig.bags.sets.
SetOperationsBase
SetUnion
- Class in
datafu.pig.bags.sets
Computes the set union of two or more bags.
SetUnion()
- Constructor for class datafu.pig.bags.sets.
SetUnion
SimpleEvalFunc
<
T
> - Class in
datafu.pig.util
Uses reflection to makes writing simple wrapper Pig UDFs easier.
SimpleEvalFunc()
- Constructor for class datafu.pig.util.
SimpleEvalFunc
StreamingMedian
- Class in
datafu.pig.stats
Computes the approximate
median
for a (not necessarily sorted) input bag, using the Munro-Paterson algorithm.
StreamingMedian()
- Constructor for class datafu.pig.stats.
StreamingMedian
StreamingQuantile
- Class in
datafu.pig.stats
Computes approximate
quantiles
for a (not necessarily sorted) input bag, using the Munro-Paterson algorithm.
StreamingQuantile(String...)
- Constructor for class datafu.pig.stats.
StreamingQuantile
T
TimeCount
- Class in
datafu.pig.date
Performs a count of events, ignoring events which occur within the same time window.
TimeCount(String)
- Constructor for class datafu.pig.date.
TimeCount
U
UnorderedPairs
- Class in
datafu.pig.bags
Generates pairs of all items in a bag.
UnorderedPairs()
- Constructor for class datafu.pig.bags.
UnorderedPairs
UserAgentClassify
- Class in
datafu.pig.urls
Given a user agent string, this UDF classifies clients to 'mobile' and 'desktop'.
UserAgentClassify()
- Constructor for class datafu.pig.urls.
UserAgentClassify
W
WilsonBinConf
- Class in
datafu.pig.stats
Computes the
Wilsonian binomial proportion confidence interval
WilsonBinConf(double)
- Constructor for class datafu.pig.stats.
WilsonBinConf
WilsonBinConf(String)
- Constructor for class datafu.pig.stats.
WilsonBinConf
A
B
C
D
E
F
G
H
I
M
N
O
P
Q
R
S
T
U
W
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV NEXT
FRAMES
NO FRAMES
All Classes
Matthew Hayes, Sam Shah