Financial Forecasting and Analysis for Low-Wage Workers

06/14/2018 ∙ by Wenyu Zhang, et al. ∙ ibm cornell university Neighborhood Trust, Inc. 0

Despite the plethora of financial services and products on the market nowadays, there is a lack of such services and products designed especially for the low-wage population. Approximately 30 engage in low-wage work, and many of them lead a paycheck-to-paycheck lifestyle. Financial planning advice needs to explicitly address their financial instability. We propose a system of data mining techniques on transactions data to provide automatic and personalized financial planning advice to low-wage workers. We propose robust methods for accurate prediction of bank account balances and automatic extraction of recurring transactions and unexpected large expenses. We formulate a hybrid method consisting of historical data averaging and a regularized regression framework for the prediction task. To uncover recurring transactions, we use a heuristic approach that capitalizes on transaction descriptions. We also propose a category-wise spending analyzer by grouping users through hierarchical clustering. Our methods achieve higher performance compared to conventional approaches and state-of-the-art predictive methods in real financial transactions data. In collaboration with Neighborhood Trust Financial Partners, the proposed methods will upgrade the functionalities in WageGoal, Neighborhood Trust Financial Partners' web-based application that provides budgeting and cash flow management services to a user base comprising mostly low-income individuals. The proposed methods will therefore have a direct impact on the individuals who are currently connected to the product as well as future potential users.



There are no comments yet.


page 1

page 2

page 3

page 4

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

1 Background and Motivation

Low-wage workers make up a large portion of the working population, and need the most help in financial planning. The U.S. Bureau of Labor Statistics defines low-wage work according to three hourly-wage levels [13]:

  • ($9.25) to lift family of two above poverty line,

  • ($10.75) to lift family of three above poverty line,

  • ($13.50) to lift family of three to 125% of poverty line,

where the corresponding wages in 2013 are in parenthesis. Approximately 15,000,000 U.S. workers earned up to $9.25/hr and 36,000,000 earned up to $13.50/hr in 2013. These are equivalent to 12.63% and 29.54% of the U.S. working population respectively. To put the wages into perspective, $13.50/hr amounts to a monthly income of just over $2000, while the median rent for one-bedroom apartments across 50 major U.S. cities is already $1200 [9].

Many low-income households live paycheck-to-paycheck, at risk of financial instability in the event of medical, job or other unforseen emergencies when they are forced to take up loans and in the process incur additional charges. Reference [24] reports that the largest U.S. banks charged $11.6 billion in overdraft and insufficient fund fees in 2015, of which a significant portion is attributed to the poor. To make matters worse, the average debit charge triggering such fees is $24, while the typical overdraft fee is $35.

The poor are affected by obstacles including high cost of personalized financial services, lack of suitable banking services [4], and hassle of seeking financial services [21]. We hope to alleviate these obstacles through data-driven solutions. In this paper, we propose a system of data mining techniques to improve automatic and personalized financial planning advice to low-wage workers. We work with anonymized checking, savings, and credit card account transactions data. User identification information such as age, gender, and size of household is not used. We also work in the small data scenario where low-income individuals may not have a long banking history due to the aforementioned obstacles.

This work is motivated by use cases for Neighborhood Trust Financial Partners’ WageGoal app which caters primarily to low-income individuals [23]. Figure 1 contains screenshots illustrating some current functionalities such as giving a cash flow snapshot of the user’s income, bills and expenses, and helping with cash flow management. As illustrated in Figure 2, our main contributions are to propose methods for:

  • Short- and long-term prediction of bank account balances with improved accuracy;

  • Automatic extraction of recurring transactions and unexpected large expenses.

Figure 1: Some current functionalities of WageGoal.
Figure 2: Proposed functionalities for WageGoal.

The functionalities inform users about their possible future spending behavior so that they have enough time to adjust and to save up for emergencies. The main difficulties of these tasks originate from the multifaceted nature of transactions data. A user’s spending depends on individual needs and historical spending, but can also exhibit patterns similar to other users. Moreover, salary, bills and other recurring transactions provide characteristic features of a user’s spending behavior, but these cyclic patterns can be noisy and inconsistent at times. Our methods address these difficulties by both effectively mining a user’s recurring transactions and borrowing strength from the spending patterns of other users. We tested on two real financial transactions datasets, one is a smaller dataset from actual WageGoal users, and the other is a larger publicly available dataset from PKDD’99 Discovery Challenge. Our methods achieve higher performance compared to conventional approaches and state-of-the-art prediction methods on both datasets.

We differentiate the proposed functionalities from those already offered by personal finance apps on the market [10]

. The apps mainly track cash flow and provide simple budgeting tools, such as calculating an average daily “spendable" amount based on a user’s income and saving goal. For low-wage workers whose bank balance can hover dangerously around zero, more accurate estimates and finer-grained financial analysis are necessary.

2 Transactions Data: Overview

The WageGoal app collects users’ transactions through authorized accounts using a third-party service Plaid [25]. Each transaction is associated with an account ID, date, description or merchant name, amount and category. Table 1 shows a sample snippet of the transactions captured. There are a total of 11 categories, namely Bank Fees, Cash Advance, Community, Food and Drink, Healthcare, Interest, Payment, Recreation, Service, Shops, and Travel. Uncategorized transactions are labeled NA. In the WageGoal Dataset, 28% of the transactions are uncategorized. The category labels are provided through Plaid and we do not attempt to address the problem of missing labels in this paper. Daily account balances are retrospectively calculated from current balances.

Date Description $ Category
1 6/22/2016 IKEA 20 Shops
2 6/22/2016 Target 10 NA
1 6/24/2016 Starbucks 15 Food & Drink
3 6/24/2016 Interest -0.01 Interest
3 6/24/2016 Direct Deposit -1000 Transfer
Table 1: WageGoal Dataset: Examples of transactions data.

2.1 Initial Cluster Analysis

The WageGoal dataset consists of 19 users with approximately one year of transactions. Using Dynamic Time Warping (DTW) distance [6, 30], we cluster the overall balance sequences for the last available month by hierarchical agglomerative clustering. Balance sequences are set to zero-mean which does not affect the spending pattern. We use DTW window = 2 to allow for slight time series misalignments.

Setting the number of clusters to five, two clusters consist of one user each. For the three other clusters, we plot their average category-wise spending in Figure 3, which shows a clear distinction between the less and more well-off users. The least well-off (blue) group spent less and also incurred more bank fees compared to the moderate (green) and the most well-off (yellow) groups. The three groups spent similarly in four other categories omitted from the figure.

(a) Transfer (negative spending = income)
(b) Selected categories
Figure 3: WageGoal Dataset: Average category-wise spending for each cluster of users: blue, green and yellow.

Temporal patterns in balance sequences help to distinguish the users’ financial status. Hence, we devise our prediction method to borrow strength from the data of similar users.

3 Related Work

Most works on transactions data make use of RFM (Recency, Frequency and Monetary value) to define and analyze customer value [7, 16, 11]. However, these summary statistics mask too many details for our purpose. Instead, we devise data mining and time series techniques to extract more information from the raw data and also make use of similarities amongst users’ spending behaviors to effectively improve prediction.

3.1 Prediction

Two-part models are popular in modeling household finances, medical expenditure and other nonnegative data [8, 19, 20, 22], and can be formulated to cluster the subjects such that each cluster receives different parameter values. They often use covariates as part of the binary and continuous component representations. Covariates include time-related and class-membership variables, such as gender and employee vs. dependency status [22], which need extra manual effort to construct or are simply not available in anonymized data.

A traditional time series model is the ARMA (autoregressive moving average) which regresses the current variable value on the past [27]. Seasonal ARMA is used on periodic data such as traffic flow [29]

. Since transactions contain multiple seasonal components which may not have fixed periodicities, a plain seasonal ARMA is not suitable. Neural networks are used to model ARMA model residuals

[32] or directly to predict [15], but they require large data for good performance.

In other data-driven time series analysis literature, Taylor and Letham [28] implement a fast regression-based method that includes holiday effects, but models only weekly seasonality and requires handcrafted variables to indicate holiday effects. Approaches reported in [3, 17, 31]

extract similar sequences in historical data and use KNN regression to predict by taking unweighted or weighted averages of the samples immediately after the matched sequences. These may not be robust enough against “anomalies" or spikes not uncommon in expenditure. To induce sparsity on regression coefficients,


uses a spike-and-slab prior. Markov chain Monte Carlo methods estimate the full Bayesian model parameters, but they take significant computation time to iteratively forecast multiple future days, one day at a time.

While the aforementioned methods are suitable for “well-behaved" time series, we incorporate design features suitable for transactions, while working under the framework that no further data annotations are required and data is limited at early stages of user enrollment. We recognize that different bank account types and prediction horizons require different treatment, and hence propose a hybrid approach. One aspect of our approach works on the level of individual transactions and relies on extracted recurring transactions, while the other works from the holistic point-of-view of overall balance, where we extract similar sequences and use a computationally efficient regularized regression scheme to penalize anomalous sequences. Another key innovation is that we align the extracted sequences using landmark transactions before regressing, which increases prediction accuracy.

3.2 Finding recurring transactions

It is common to identify time series periodicities in the frequency domain. A Lomb-Scargle periodogram approach to treat missing values and unevenly-spaced time points in finding periodicity in gene expression patterns is proposed in [14]. Another thread line of research [30, 18]

learns the vector-form representation for multi-variable time-series to use in subsequent downstreaming tasks. There are also methods such as

[12] which address the problem directly in the time domain. Although some of these methods are capable of detecting multiple periodicities, they require the period of each cyclic component to be consistent across time. In transactions, there can be significant jitter in the periodicity for recurring transactions such as payments due to differences in the number of days each month and the presence of holidays. Moreover, just using numerical values is insufficient to identify recurring transactions since there is substantial noise, and this is especially so for recurring transactions with small dollar amounts. Manually constructing and maintaining a complete biller’s list for each user’s recurring transactions is ideal but tedious. Hence in this paper, we propose a procedure that automatically identifies possible recurring transactions that takes into account the inconsistent nature of transactions and better distinguish between transactions through their textual descriptions.

4 Technical Solutions

We propose methods that use transactions data as described in Section 2. The main challenges of working with transactions data versus conventional numerical time series are the presence of:

  • Text description for each transaction;

  • Multiple noisy and inconsistent periodic patterns;

  • Spikes in spending.

4.1 Prediction of account balances

We predict account balances up to 31 days ahead to encompass two semimonthly paydays to give users sufficient time and information to plan their finances. We propose a historical averaging method HistAvg for short-term prediction and accounts with minimal transactions, and a regularized least squares method SubseqLS for long-term prediction of accounts with distinct cyclic patterns. To effectively address the nuances in modeling different account types, we further propose a hybrid method HistAvg-SubseqLS where the first days are predicted by HistAvg and the rest by SubseqLS.

4.1.1 HistAvg

HistAvg predicts daily spending and is adapted from the current implementation in WageGoal. The original version uses a biller’s list for bills, while we use recurring transactions found by our proposed procedure in Section 4.2. Predicting spending using past three months’ transactions follows:

  1. Remove recurring charges and top 10% of transactions;

  2. Calculate daily basic spending as the average amount spent daily according to the remaining transactions;

  3. Estimate future spending as the sum of daily basic spending and any recurring charge on that day.

Top 10% of transactions are excluded since these are typically rare purchases. Finally, the account balance is the sum of the previous day’s balance and the estimated spending.

4.1.2 SubseqLS

We assume that a similar balance history implies a similar future save for some “anomalies”. This motivates us to make use of all available historical data across users for prediction.

SubseqLS predicts one-day ahead by first setting the target account’s balance sequence from the immediate past as the length query vector . It then selects balance sequences that are similar to in its first values from historical balances of all users. Finally, it determines the best weights for the sequences to match . We combine the sequences linearly for simplicity of the model, but more flexible combinations are also possible in principle. Essentially, for account of user , we consider

for some transformation and estimate the coefficients for a good prediction performance. Weights determined in traditional KNN regression methods tend to overfit to the query and are not robust to “anomalies”, so we regularize the estimation to avoid this.

Let be the current date, and be the number of days to predict. We set so that is sufficiently long to capture most recurring transactions. To recap notations, is the query for some account of some user , and is the -day ahead balance to be predicted. Each selected sequence is in length. We use DTW distance with window = 2 to measure sequence similarity to allow slight misalignments, and iteratively find each using the fast search in [26]. We then locally expand or contract the sequences to adjust for temporal variations through function , which aligns each to a template of payday events, since paydays mark the start of cyclic spending patterns. Payday estimation is described in Section 4.2. Denoting the aligned sequence as , the first subsequence with length is used to match , and the second with length is used for prediction. Note that may not be observed at time but is just matched to .

We outline SubseqLS algorithm for one-day ahead prediction below. All sequences mentioned are standardized.

  1. Set query vector consisting of user U, account A’s daily balance from to .

  2. Find sequences of length most similar to in DTW distance in the first values.

  3. Create a template of indicators marking user U’s paycheck deposits into account A between and , with magnitude being the paycheck values. Set to be the function that aligns any sequence to this template by DTW.

  4. Align each such that .

  5. Estimate and nonnegative coefficients which minimize the following objective

    where , and obtain and .

  6. Estimate .

(a) Original sequences
(b) Aligned sequences
Figure 4: Effect of aligning to template of payday landmarks: black solid line is query , colored dotted lines are first values of matched sequences.

Aligning in Step 4 is an essential adjustment for temporal variations of matched sequences. Figure 4 shows how this preserves the exact cyclic pattern of .

We set in Step 2 so that similarity matching is done on a sufficiently long sequence with at least one semimonthly pay period. We let to eliminate sequences in which do not consistently match in Step 5. Matrix also penalizes based on anomalous predictions of each . Weight is 1 if , 5 if and 10 if to emphasize more accurate estimation of the tail of which is closer to the start of prediction.

4.2 Extraction of recurring transactions and unexpected large expenses

We propose a procedure to automatically extract recurring transactions in each account, which include bills and periodic behavior such as salary, recurring money transfer, and grocery shopping. They are split into monthly, semimonthly, biweekly and weekly frequency. For spending category C, for each transaction and frequency, we look for past transactions with similar descriptions satisfying the frequency constraint. Figure 5 shows the procedure for monthly charges. We start with all transactions within a 7-day window, denoted . We backtrack by 31 days and retrieve all transactions in a window that have descriptions similar to those in . We repeat till we obtain 4 windows of transactions to ensure that the remaining transactions identified in the last window indeed have the desired frequency.

Figure 5: Procedure to extract monthly charges.

For monthly and semimonthly charges, we use a 7-day window to accommodate differing month lengths, and use a 2-day window otherwise to accommodate small spending shifts due to holidays, etc. To compare descriptions, we use the Python difflib module [1] to iteratively find the longest contiguous matching subsequences excluding junk elements, with a similarity threshold ratio of to accommodate insignificant differences such as dates and reference numbers.

To predict the next occurrence of a monthly transaction, we estimate the date as the last observed transaction date plus one month, and the amount as the historical average. We do similarly for semimonthly, biweekly and weekly transactions.

We further use the recurring transactions to find unexpected or anomalous large expenses. On each user’s transactions:

  1. Remove all recurring transactions;

  2. Retain unique transactions in remaining top 10%.

The results are pooled across all users, and the list of expenses displayed to a user can be personalized depending on their characteristics (e.g., car owner, a person with children).

5 Empirical Evaluation

5.1 Data Description

Since WageGoal is a relatively new app, the dataset collected is limited in terms of the number of users and the length of usage. We hence include an additional financial dataset from the PKDD’99 Discovery Challenge. We note that the PKDD’99 Dataset is not specific to low-wage workers.

5.1.1 WageGoal Dataset

This dataset is collected from 19 individuals, with approximately one year’s worth of financial transactions from June 21, 2016, to June 16, 2017, in checking, savings, and credit card accounts. There are a total of 52 accounts of which 16 are checking, 19 are savings, and 17 are other accounts including credit cards. Each line item in the data includes date, description, amount, category and final account balance as described in Section 2. The one year’s worth of data is split into a training period of nine months and a testing period of three months. All users have semimonthly income.

5.1.2 PKDD’99 Dataset

This is a publicly available dataset of real anonymized bank transactions from January 1, 1993 to December 31, 1998 [5]. We test here the scenario of long historical data and retain the 2263 accounts with at least four years of data. Accounts have sparse transactions with a maximum of 52.479 per year, so we consider weekly instead of daily balances. Training and testing periods are 4.5 and 1.5 years respectively. Each line item includes date and amount. No information on description and category is provided, and hence this dataset is not used to evaluate any other task besides prediction.

5.2 Prediction of account balances

From the test set, 25 length-31 sequences are randomly chosen for prediction. We compute two different error measures:

  • MAE (Mean Absolute Error), the average absolute difference between true and predicted account balances;

  • Average difference in dollar amounts between true and predicted balances when the former becomes negative.

The point in time at which balances become negative is of special interest because penalty fees will start being charged. We use only the first error measure on the PKDD’99 dataset, the second is not applicable because true account balances in the PKDD’99 dataset are unknown and we arbitrarily set all account balances to start at 0. To calculate the error measures, we scale all accounts to have variance equal to 100 so that they contribute approximately equally.

We compare the performance of the individual methods HistAvg and SubseqLS, the hybrid HistAvg-SubseqLS, as well as Prophet, ARMA, NearestNeighbor and KNN averaging. Prophet is a state-of-the-art forecasting method from Facebook [28] that uses a regression model to fit a linear trend, and incorporates weekly seasonality and holiday effects by marking them through indicator variables. We used paydays in lieu of holiday effects. ARMA is a well-established model for stationary stochastic processes [27], and we implemented ARMA with parameters found through the statsmodels module using default arguments [2]. K-Nearest Neighbor is a popular nonparametric approach that is flexible and applied widely in diverse domains such as traffic flow and energy [3, 31]. In NearestNeighbor, only the top matched sequence to the query is used by directly taking the value immediately after the match as the one-day-ahead prediction, and in KNN averaging, the top 10 matched sequences are used and the one-day-ahead prediction is the average of the values immediately after all 10 matches.

5.2.1 Prediction with WageGoal Dataset

The WageGoal Dataset is split into two types of accounts, those with paycheck income transactions (20 accounts), and those without (32 accounts). The former demonstrates more pronounced cyclic pattern as in Figure 7 and 7.

(a) Example balance sequences
(b) Standard deviations
(a) Example balance sequences
(b) Standard deviations
Figure 6: WageGoal Dataset: Paycheck accounts
Figure 7: WageGoal Dataset: Non-paycheck accounts
Figure 6: WageGoal Dataset: Paycheck accounts

The training set is used to optimize the number of matches and penalty parameter in SubseqLS by cross-validation and also to determine the switching parameter for HistAvg-SubseqLS. Search range for is in multiples of 5 between 5 and 25, values are between 0 and 10, and values are integers between 0 and . For paycheck accounts, and . For non-paycheck accounts, and , meaning HistAvg-SubseqLS reduces to HistAvg. Parameter is determined individually for each account and hence not reported here.

Table 2 shows the test results. HistAvg-SubseqLS almost always performs the best, and is otherwise a close second. Figure 8 plots the average absolute difference between the actual and predicted account balance across time.

Paycheck account balances tend to have pronounced semimonthly patterns starting with a sharp increase at payday followed by a decrease to pre-payday levels. These cyclic patterns sometimes perpetuate through historical data and are shared across users, which make sequence-matching suitable. SubseqLS benefited from regularization in this small dataset because not all top matches were close matches. KNN, in contrast, had average prediction error at least 50 times higher than all other methods. In Figure 7(a), SubseqLS maintained almost consistent error across time, while other methods had higher errors predicting further ahead. Due to the regression formulation, SubseqLS focuses on modeling overall trend instead of next-day prediction. Weights provides some balance, but are difficult to tune. In Figure 7(a), HistAvg had the best short-term predictions as its next-day prediction is designed to be close to the current observation. Switching parameter let HistAvg-SubseqLS use HistAvg for 3 days before switching to SubseqLS, thereby attaining the lowest errors in both short- and long-term.

Non-paycheck accounts are mostly used for savings and occasional purchases. Common transactions include spare change saved through bank programs. The lack of prominent structures in balance sequences resulted in poor matches found for NearestNeighbor, KNN and SubseqLS. As in Figure 7(b), HistAvg performed the best by making conservative predictions. The switching parameter correctly picked to use HistAvg throughout the prediction period, so that HistAvg-SubseqLS shared the same good performance.

Methods MAE Error in amount
HistAvg 11.417 7.460
SubseqLS 7.005 5.774
HistAvg-SubseqLS 6.790 5.099
Prophet 9.534 7.508
ARMA 7.941 6.983
NearestNeighbor 19.126 11.851
KNN 1105.719 88.193
(a) Paycheck accounts
Methods MAE Error in amount
HistAvg 6.876 6.863
SubseqLS 18.474 7.226
HistAvg-SubseqLS 6.876 6.863
Prophet 8.794 11.944
ARMA 6.565 6.981
NearestNeighbor 15.439 6.832
KNN 7212.774 370.220
(b) Non-paycheck accounts
Table 2: WageGoal Dataset: Account balance prediction errors averaged across 25 one-month test samples.
(a) Paycheck accounts
(b) Non-paycheck accounts
Figure 8: WageGoal Dataset: Average account balance prediction error over time across 25 test samples. KNN is excluded due to large magnitude of error.

5.2.2 Prediction with PKDD’99 Dataset

Balances exhibit cyclic patterns as in Figure 9, just like WageGoal paycheck account balances. Since transaction descriptions and categories are not available, we cannot extract any recurring transaction for HistAvg, SubseqLS, and Prophet. We also do not report HistAvg-SubseqLS, since HistAvg, being reliant on good estimates of recurring transactions, is heavily handicapped in this setup and does not benefit the hybrid approach.

(a) Example balance sequences
(b) Standard deviations
Figure 9: PKDD’99 Dataset: Properties of accounts

In each iteration of the experiment, we randomly select 52 accounts to predict. We present test results across eight iterations in Table 3. Figure 10 plots the average absolute difference between the actual and predicted account balance across time. Solid lines are the mean across all iterations, and the shaded regions are the to percentile.

Results mirror those in Section 5.2.1 that SubseqLS performs the best in long-term predictions, showing it is consistent across different scenarios and dataset sizes. Furthermore, in this larger dataset, SubseqLS achieved low errors in short-term predictions as well. Larger data benefit sequence-matching methods since more close matches are available. For instance, KNN’s performance for the PKDD’99 Dataset is also much higher than that for the WageGoal Dataset.

Methods MAE Mean MAE Standard deviation
HistAvg 12.245 0.578
SubseqLS 7.017 0.277
Prophet 7.794 0.311
ARMA 7.967 0.455
NearestNeighbor 9.243 0.674
KNN 8.012 0.751
Table 3: PKDD’99 Dataset: Account balance prediction errors averaged across 25 length-31 test samples.
Figure 10: PKDD’99 Dataset: Average account balance prediction error over time across 25 test samples.

5.3 Extraction of recurring transactions and unexpected large expenses

From the test set of the WageGoal Dataset, 25 dates are randomly picked. For each date, we extract recurring transactions prior to the date, and predict the dates for their next occurrence. A transaction is correctly extracted if the prediction is within 5 days of the true date. We evaluate the quality of the extraction procedure by:

  • Average number of true recurring transactions extracted per user;

  • Precision, i.e., proportion of recurring transactions extracted that is true;

  • Average error in days for the predicted dates at which recurring transactions next occur.

We compare the performance of our proposed procedure with an extraction method utilizing transaction descriptions and category labels. The competing method flags a transaction as recurring if the description contains the word ‘recurring’ or if the category label contains the following keywords: ‘bill pay’, ‘payroll’, ‘service - insurance’ and ‘service - subscription’. These rules are manually formulated based on close inspection of the dataset.

As in Table 4, the proposed method identified a larger number of true recurring transactions and with higher precision. Despite extracting more than double the number of true recurring transactions, the proposed method was only half a day worse on average in predicting the next transaction date. We combine the recurring transactions found by both methods above and use them to obtain a list of unexpected large expenses. Some examples of their approximate costs are shown in Table 5. We provided only a single value for each cost, but given observations of the expense from more users, a range or empirical distribution will be appropriate.

Method # extracted Precision
Error in
Proposed 4.633 0.647 1.465
Using labels 2.161 0.311 1.043
Table 4: WageGoal Dataset: Performance in extraction of recurring transactions across 25 test samples.
Description Cost ($)
House cleaning service 350
Car repair 500
Student loan 5000
Roofing 8000
Table 5: WageGoal Dataset: Examples of unexpected large expenses.

6 Significance and Impact

Our system will upgrade and replace existing models in WageGoal, and therefore have a direct and near-immediate impact on the low-income individuals connected to the product. The enhancements will help users manage their volatile cash flow, capitalize on opportunities for debt reduction and savings, obtain greater overall financial health and stay out of poverty. Strategic partnerships will help Neighborhood Trust further penetrate relevant markets in the coming years, eventually reaching many tens of thousands of clients nation-wide through its technology platforms.

Robust tracking systems are in place to measure the outcomes. Results will be shared as appropriate via Neighborhood Trust’s network of financial empowerment providers and other interested stakeholders.

7 Conclusion

We proposed a system of data mining techniques to predict and analyze spending behaviors. Future work includes improving the predictive models by incorporating additional individual- and group-level information, providing early and enhanced visibility for users into their financial health, and automatically generating personalized recommendations for improving financial stability. Depending on deployment feedback, other improvements will be considered as necessary.

8 Acknowledgments

The authors thank Mary Coker, Camilla Nestor, and Steve Silverstein from Neighborhood Trust, Karan Bhatia from Philosphie, and Saška Mojsilović from IBM Research for their help and support.


  • [1] difflib: Helpers for computing deltas, 2017.
  • [2] StatsModels: Statistics in python, 2017.
  • [3] F. M. Alvarez, A. Troncoso, J. C. Riquelme, and J. S. A. Ruiz. Energy time series forecasting based on pattern sequence similarity. IEEE TKDE, 23(8):1230–1243, Aug 2011.
  • [4] M. S. Barr. Banking the poor: Policies to bring low-income americans into the financial mainstream. In Research Brief. The Brookings Institution, 2004.
  • [5] P. Berka and M. Sochorova. PKDD Discovery Challenge - Guide to the Financial Data Set, 1999.
  • [6] D. J. Berndt and J. Clifford. Using dynamic time warping to find patterns in time series. In KDD, volume 10, pages 359–370. Seattle, WA, 1994.
  • [7] D. Birant. Data mining using RFM analysis. In K. Funatsu, editor, Knowledge-Oriented Applications in Data Mining, chapter 6, pages 91–208. InTech, 2011.
  • [8] S. Brown, P. Ghosh, L. Su, and K. Taylor. Modelling household finances: A Bayesian approach to a multivariate two-part model. Journal of Empirical Finance, 33(C):190–207, 2015.
  • [9] Business Insider. Here’s what the typical one-bedroom apartment costs in 50 US cities, 2016.
  • [10] Business Insider. The 5 best apps to help you manage your money, 2017.
  • [11] H.-C. Chang and H.-P. Tsai. Group RFM analysis as a novel framework to discover better customer consumption behavior. Expert Systems with Applications, 38(12):14499 – 14513, 2011.
  • [12] M. G. Elfeky, W. G. Aref, and A. K. Elmagarmid. Periodicity detection in time series databases. IEEE TKDE, 17(7):875–887, July 2005.
  • [13] V. Fusaro and H. Shaefer. How should we define “low-wage" work? an analysis using the current population survey. Monthly Labor Review, U.S. Bureau of Labor Statistics, 10 2016.
  • [14] E. F. Glynn, J. Chen, and A. R. Mushegian. Detecting periodic patterns in unevenly spaced gene expression time series using lomb–scargle periodograms. Bioinformatics, 22(3):310–316, Feb. 2006.
  • [15] L. Gonzaga Baca Ruiz, M. Cuéllar, M. Calvo-Flores, and M. d. C. Pegalajar Jiménez. An application of non-linear autoregressive neural networks to predict energy consumption in public buildings. Energies, 9:684, 08 2016.
  • [16] M. Khajvand, K. Zolfaghar, S. Ashoori, and S. Alizadeh. Estimating customer lifetime value based on RFM analysis of customer purchase behavior: Case study. Procedia Computer Science, 3:57 – 63, 2011. World Conference on Information Technology.
  • [17] S. L. Scott and H. R. Varian. Predicting the present with Bayesian structural time series. Int. J. Math. Model. Num. Opt., 5:4–23, Jan. 2014.
  • [18] Q. Lei, J. Yi, R. Vaculin, L. Wu, and I. S. Dhillon. Similarity preserving representation learning for time series analysis. arXiv preprint arXiv:1702.03584, 2017.
  • [19] Y. Min and A. Agresti. Modeling nonnegative data with clumping at zero: A survey. Journal of the Iranian Statistical Society, 1, 2002.
  • [20] J. Mullahy. Much ado about two: Reconsidering retransformation and the two-part model in health econometrics. J. Health Economics, 17(3), 1998.
  • [21] S. Mullainathan and E. Shafir. Savings policy and decisionmaking in low-income households. In National Poverty Center Policy Briefs, chapter 24. University of Michigan, 2010.
  • [22] B. Neelon, A. J. O’Malley, and S.-L. T. Normand. A Bayesian two-part latent class model for longitudinal medical expenditure data: Assessing the impact of mental health and substance abuse parity. Biometrics, 67(1):280–289, 2011.
  • [23] Neighborhood Trust Financial Partners. Neighborhood Trust Financial Partners And FlexWage Solutions Announce Partnership to Develop WageGoal, 2016.
  • [24] Pew Charitable Trusts. Consumers need protection from excessive overdraft costs. A brief from The Pew Charitable Trusts, 12 2016.
  • [25] Plaid., 2018.
  • [26] T. Rakthanmanon, B. Campana, A. Mueen, G. Batista, B. Westover, Q. Zhu, J. Zakaria, and E. Keogh. Searching and mining trillions of time series subsequences under dynamic time warping. In 18th ACM SIGKDD, pages 262–270, 2012.
  • [27] R. Shumway and D. Stoffer. Time Series Analysis and Its Applications - With R Examples. Springer, New York, 2 edition, 2006.
  • [28] S. J. Taylor and B. Letham. Forecasting at scale. PeerJ Preprints, 2017.
  • [29] B. Williams and L. A. Hoel. Modeling and forecasting vehicular traffic flow as a seasonal arima process: Theoretical basis and empirical results. Journal of Transportation Engineering, 129:664–672, 11 2003.
  • [30] L. Wu, I. E.-H. Yen, J. Yi, F. Xu, Q. Lei, and M. Witbrock. Random warping series: A random features method for time-series embedding. In

    International Conference on Artificial Intelligence and Statistics

    , pages 793–802, 2018.
  • [31] L. Zhang, Q. Liu, W. Yang, N. Wei, and D. Dong. An improved k-nearest neighbor model for short-term traffic flow prediction. Procedia - Social and Behavioral Sciences, 96(Supplement C):653 – 662, 2013.
  • [32] P. Zhang. Time series forecasting using a hybrid arima and neural network model. neurocomputing. Neurocomputing, 50:159–175, 01 2003.