A B C D E F G H I M N O P Q R S T U W

A

accumulate(Tuple) - Method in class datafu.pig.linkanalysis.PageRank
 
accumulate(Tuple) - Method in class datafu.pig.sessions.Sessionize
 
accumulate(Tuple) - Method in class datafu.pig.stats.StreamingQuantile
 
addEdges(Integer, ArrayList<Map<String, Object>>) - Method in class datafu.linkanalysis.PageRank
 
AliasBagFields - Class in datafu.pig.bags
Re-alias the fields inside of a bag.
AliasBagFields(String) - Constructor for class datafu.pig.bags.AliasBagFields
 
all_equal(PriorityQueue<SetIntersect.pair>) - Method in class datafu.pig.bags.sets.SetIntersect
 
AppendToBag - Class in datafu.pig.bags
Appends a tuple to a bag.
AppendToBag() - Constructor for class datafu.pig.bags.AppendToBag
 
ASSERT - Class in datafu.pig.util
Asserts some boolean.
ASSERT() - Constructor for class datafu.pig.util.ASSERT
 

B

BagConcat - Class in datafu.pig.bags
Concatenates the tuples from a set of bags, producing a single bag containing all tuples.
BagConcat() - Constructor for class datafu.pig.bags.BagConcat
 
BagSplit - Class in datafu.pig.bags
Splits a bag of tuples into a bag of bags, where the inner bags collectively contain the tuples from the original bag.
BagSplit() - Constructor for class datafu.pig.bags.BagSplit
 
BagSplit(String) - Constructor for class datafu.pig.bags.BagSplit
 
binconf(Long, Long) - Method in class datafu.pig.stats.WilsonBinConf
 
BoolToInt - Class in datafu.pig.util
UDF which converts a Boolean to an Integer.
BoolToInt() - Constructor for class datafu.pig.util.BoolToInt
 

C

call(DataBag) - Method in class datafu.pig.bags.AliasBagFields
 
call(DataBag, Tuple) - Method in class datafu.pig.bags.AppendToBag
 
call(DataBag) - Method in class datafu.pig.bags.Enumerate
 
call(DataBag, Tuple) - Method in class datafu.pig.bags.PrependToBag
 
call(DataBag) - Method in class datafu.pig.date.TimeCount
 
call(Double, Double, Double, Double) - Method in class datafu.pig.geo.HaversineDistInMiles
 
call(String) - Method in class datafu.pig.hash.MD5
 
call(String) - Method in class datafu.pig.hash.MD5Base64
 
call(Integer, Integer) - Method in class datafu.pig.numbers.RandInt
 
call(DataBag) - Method in class datafu.pig.stats.Quantile
 
call(DataBag) - Method in class datafu.pig.stats.StreamingQuantile
 
call(Number, Number) - Method in class datafu.pig.stats.WilsonBinConf
 
call(String) - Method in class datafu.pig.urls.UserAgentClassify
 
call(Boolean) - Method in class datafu.pig.util.BoolToInt
 
call(Integer) - Method in class datafu.pig.util.IntToBool
 
cleanup() - Method in class datafu.pig.linkanalysis.PageRank
 
cleanup() - Method in class datafu.pig.sessions.Sessionize
 
cleanup() - Method in class datafu.pig.stats.StreamingQuantile
 
clear() - Method in class datafu.linkanalysis.PageRank
 
commit(PageRank.ProgressIndicator) - Method in class datafu.linkanalysis.PageRank
 

D

datafu.linkanalysis - package datafu.linkanalysis
 
datafu.pig.bags - package datafu.pig.bags
 
datafu.pig.bags.sets - package datafu.pig.bags.sets
 
datafu.pig.date - package datafu.pig.date
 
datafu.pig.geo - package datafu.pig.geo
 
datafu.pig.hash - package datafu.pig.hash
 
datafu.pig.linkanalysis - package datafu.pig.linkanalysis
 
datafu.pig.numbers - package datafu.pig.numbers
 
datafu.pig.sessions - package datafu.pig.sessions
 
datafu.pig.stats - package datafu.pig.stats
 
datafu.pig.urls - package datafu.pig.urls
 
datafu.pig.util - package datafu.pig.util
 
disableDanglingNodeHandling() - Method in class datafu.linkanalysis.PageRank
Disables dangling node handling (disabled by default).
disableEdgeDiskCaching() - Method in class datafu.linkanalysis.PageRank
Disable disk caching of edges once there are too many (disabled by default).
DistinctBy - Class in datafu.pig.bags
Get distinct elements in a bag by a given set of field positions.
DistinctBy(String...) - Constructor for class datafu.pig.bags.DistinctBy
 
distribute(PageRank.ProgressIndicator) - Method in class datafu.linkanalysis.PageRank
 

E

EARTH_RADIUS - Static variable in class datafu.pig.geo.HaversineDistInMiles
 
edgeCount() - Method in class datafu.linkanalysis.PageRank
 
enableDanglingNodeHandling() - Method in class datafu.linkanalysis.PageRank
Enables dangling node handling (disabled by default).
enableEdgeDiskCaching() - Method in class datafu.linkanalysis.PageRank
Enable disk caching of edges once there are too many (disabled by default).
Enumerate - Class in datafu.pig.bags
Enumerate through a bag, replacing each (elem) with (elem, idx).
Enumerate() - Constructor for class datafu.pig.bags.Enumerate
 
Enumerate(String) - Constructor for class datafu.pig.bags.Enumerate
 
Enumerate(String, String) - Constructor for class datafu.pig.bags.Enumerate
 
exec(Tuple) - Method in class datafu.pig.bags.BagConcat
 
exec(Tuple) - Method in class datafu.pig.bags.BagSplit
 
exec(Tuple) - Method in class datafu.pig.bags.DistinctBy
 
exec(Tuple) - Method in class datafu.pig.bags.FirstTupleFromBag
 
exec(Tuple) - Method in class datafu.pig.bags.NullToEmptyBag
 
exec(Tuple) - Method in class datafu.pig.bags.sets.SetIntersect
 
exec(Tuple) - Method in class datafu.pig.bags.sets.SetUnion
 
exec(Tuple) - Method in class datafu.pig.bags.UnorderedPairs
 
exec(Tuple) - Method in class datafu.pig.linkanalysis.PageRank
 
exec(Tuple) - Method in class datafu.pig.sessions.Sessionize
 
exec(Tuple) - Method in class datafu.pig.stats.MarkovPairs
 
exec(Tuple) - Method in class datafu.pig.util.ASSERT
 
exec(Tuple) - Method in class datafu.pig.util.SimpleEvalFunc
 

F

FirstTupleFromBag - Class in datafu.pig.bags
 
FirstTupleFromBag() - Constructor for class datafu.pig.bags.FirstTupleFromBag
 

G

getEdgeCachingThreshold() - Method in class datafu.linkanalysis.PageRank
Gets the number of edges past which they will be cached on disk instead of in memory.
getNodeIds() - Method in class datafu.linkanalysis.PageRank
 
getNodeRank(int) - Method in class datafu.linkanalysis.PageRank
 
getQuantilesFromParams(String...) - Static method in class datafu.pig.stats.QuantileUtil
 
getReturnType() - Method in class datafu.pig.util.SimpleEvalFunc
 
getTotalRankChange() - Method in class datafu.linkanalysis.PageRank
 
getValue() - Method in class datafu.pig.linkanalysis.PageRank
 
getValue() - Method in class datafu.pig.sessions.Sessionize
 
getValue() - Method in class datafu.pig.stats.StreamingQuantile
 

H

HaversineDistInMiles - Class in datafu.pig.geo
Computes the distance (in miles) between two latitude-longitude pairs using the Haversine formula.
HaversineDistInMiles() - Constructor for class datafu.pig.geo.HaversineDistInMiles
 

I

init(PageRank.ProgressIndicator) - Method in class datafu.linkanalysis.PageRank
 
IntToBool - Class in datafu.pig.util
UDF which converts an Integer to a Boolean.
IntToBool() - Constructor for class datafu.pig.util.IntToBool
 
isEdgeDiskCachingEnabled() - Method in class datafu.linkanalysis.PageRank
Gets whether edge disk caching is enabled.
isUsingEdgeDiskCache() - Method in class datafu.linkanalysis.PageRank
Gets whether disk is being used to cache edges.

M

MarkovPairs - Class in datafu.pig.stats
Accepts a bag of tuples, with user supplied ordering, and generates pairs that can be used for a Markov chain analysis.
MarkovPairs() - Constructor for class datafu.pig.stats.MarkovPairs
 
MarkovPairs(String) - Constructor for class datafu.pig.stats.MarkovPairs
 
MD5 - Class in datafu.pig.hash
Computes the MD5 value of a string and outputs it in hex.
MD5() - Constructor for class datafu.pig.hash.MD5
 
MD5Base64 - Class in datafu.pig.hash
Computes the MD5 value of a string and outputs it in Base64 encoding.
MD5Base64() - Constructor for class datafu.pig.hash.MD5Base64
 
Median - Class in datafu.pig.stats
Computes the median for a sorted input bag, using type R-2 estimation.
Median() - Constructor for class datafu.pig.stats.Median
 

N

nextIteration(PageRank.ProgressIndicator) - Method in class datafu.linkanalysis.PageRank
 
nodeCount() - Method in class datafu.linkanalysis.PageRank
 
NullToEmptyBag - Class in datafu.pig.bags
UDF that, if the input is null, returns an empty bag; otherwise, returns the input bag unchanged.
NullToEmptyBag() - Constructor for class datafu.pig.bags.NullToEmptyBag
 

O

outputSchema(Schema) - Method in class datafu.pig.bags.AliasBagFields
 
outputSchema(Schema) - Method in class datafu.pig.bags.AppendToBag
 
outputSchema(Schema) - Method in class datafu.pig.bags.BagConcat
 
outputSchema(Schema) - Method in class datafu.pig.bags.BagSplit
 
outputSchema(Schema) - Method in class datafu.pig.bags.DistinctBy
 
outputSchema(Schema) - Method in class datafu.pig.bags.Enumerate
 
outputSchema(Schema) - Method in class datafu.pig.bags.FirstTupleFromBag
 
outputSchema(Schema) - Method in class datafu.pig.bags.NullToEmptyBag
 
outputSchema(Schema) - Method in class datafu.pig.bags.PrependToBag
 
outputSchema(Schema) - Method in class datafu.pig.bags.sets.SetOperationsBase
 
outputSchema(Schema) - Method in class datafu.pig.bags.UnorderedPairs
 
outputSchema(Schema) - Method in class datafu.pig.geo.HaversineDistInMiles
 
outputSchema(Schema) - Method in class datafu.pig.linkanalysis.PageRank
 
outputSchema(Schema) - Method in class datafu.pig.numbers.RandInt
 
outputSchema(Schema) - Method in class datafu.pig.sessions.Sessionize
 
outputSchema(Schema) - Method in class datafu.pig.stats.MarkovPairs
 
outputSchema(Schema) - Method in class datafu.pig.stats.Quantile
 
outputSchema(Schema) - Method in class datafu.pig.stats.StreamingQuantile
 
outputSchema(Schema) - Method in class datafu.pig.stats.WilsonBinConf
 

P

PageRank - Class in datafu.linkanalysis
An implementation of PageRank.
PageRank() - Constructor for class datafu.linkanalysis.PageRank
 
PageRank - Class in datafu.pig.linkanalysis
A UDF which implements PageRank.
PageRank() - Constructor for class datafu.pig.linkanalysis.PageRank
 
PageRank(String...) - Constructor for class datafu.pig.linkanalysis.PageRank
 
PageRank.ProgressIndicator - Interface in datafu.linkanalysis
 
PrependToBag - Class in datafu.pig.bags
Prepends a tuple to a bag.
PrependToBag() - Constructor for class datafu.pig.bags.PrependToBag
 
progress() - Method in interface datafu.linkanalysis.PageRank.ProgressIndicator
 

Q

Quantile - Class in datafu.pig.stats
Computes quantiles for a sorted input bag, using type R-2 estimation.
Quantile(String...) - Constructor for class datafu.pig.stats.Quantile
 
QuantileUtil - Class in datafu.pig.stats
 
QuantileUtil() - Constructor for class datafu.pig.stats.QuantileUtil
 

R

RandInt - Class in datafu.pig.numbers
Generates a uniformly distributed integer between two bounds
RandInt() - Constructor for class datafu.pig.numbers.RandInt
 

S

Sessionize - Class in datafu.pig.sessions
Sessionizes an input stream.
Sessionize(String) - Constructor for class datafu.pig.sessions.Sessionize
 
setEdgeCachingThreshold(long) - Method in class datafu.linkanalysis.PageRank
Set the number of edges past which they will be cached on disk instead of in memory.
SetIntersect - Class in datafu.pig.bags.sets
Computes the set intersection of two or more bags.
SetIntersect() - Constructor for class datafu.pig.bags.sets.SetIntersect
 
SetOperationsBase - Class in datafu.pig.bags.sets
 
SetOperationsBase() - Constructor for class datafu.pig.bags.sets.SetOperationsBase
 
SetUnion - Class in datafu.pig.bags.sets
Computes the set union of two or more bags.
SetUnion() - Constructor for class datafu.pig.bags.sets.SetUnion
 
SimpleEvalFunc<T> - Class in datafu.pig.util
Uses reflection to makes writing simple wrapper Pig UDFs easier.
SimpleEvalFunc() - Constructor for class datafu.pig.util.SimpleEvalFunc
 
StreamingMedian - Class in datafu.pig.stats
Computes the approximate median for a (not necessarily sorted) input bag, using the Munro-Paterson algorithm.
StreamingMedian() - Constructor for class datafu.pig.stats.StreamingMedian
 
StreamingQuantile - Class in datafu.pig.stats
Computes approximate quantiles for a (not necessarily sorted) input bag, using the Munro-Paterson algorithm.
StreamingQuantile(String...) - Constructor for class datafu.pig.stats.StreamingQuantile
 

T

TimeCount - Class in datafu.pig.date
Performs a count of events, ignoring events which occur within the same time window.
TimeCount(String) - Constructor for class datafu.pig.date.TimeCount
 

U

UnorderedPairs - Class in datafu.pig.bags
Generates pairs of all items in a bag.
UnorderedPairs() - Constructor for class datafu.pig.bags.UnorderedPairs
 
UserAgentClassify - Class in datafu.pig.urls
Given a user agent string, this UDF classifies clients to 'mobile' and 'desktop'.
UserAgentClassify() - Constructor for class datafu.pig.urls.UserAgentClassify
 

W

WilsonBinConf - Class in datafu.pig.stats
Computes the Wilsonian binomial proportion confidence interval
WilsonBinConf(double) - Constructor for class datafu.pig.stats.WilsonBinConf
 
WilsonBinConf(String) - Constructor for class datafu.pig.stats.WilsonBinConf
 

A B C D E F G H I M N O P Q R S T U W

Matthew Hayes, Sam Shah