Artificial Intelligence and Machine Learning for Foreign Exchange (Fx) Trading Part 3

ml bull

9 min read

May 28, 2023

We will run our example again but with only two variables (easier to chart two variables than 4), look at the weights and chart the decision boundary.

Firstly, lets repeat the data loading from last week with some changes to remove the charting and limit the inputs (or features) to the last 2 periods close price.

#
# IMPORT DATA From github 
#import pandas as pd 
from datetime import datetime 
url = 'https://raw.githubusercontent.com/the-ml-bull/Hello_World/main/Fx60.csv'
dateparse = lambda x: datetime.strptime(x, '%d/%m/%Y %H:%M')
df = pd.read_csv(url, parse_dates=['date'], date_parser=dateparse)
df.head(n=10)

#
# Create time shifted data as basis for model 
#import numpy as np
df = df[['date', 'audusd_open', 'audusd_close']].copy()
# x is the last 4 values so create x for each 
#df['x_t-4'] = df['audusd_close'].shift(4)
#df['x_t-3'] = df['audusd_close'].shift(3)
df['x_t-2'] = df['audusd_close'].shift(2)
df['x_t-1'] = df['audusd_close'].shift(1)
# y is points 4 periods into the future - the open price now (not close)
df['y_future'] = df['audusd_close'].shift(-3)
df['y_change_price'] = df['y_future'] - df['audusd_open']
df['y_change_points'] = df['y_change_price'] * 100000 
df['y'] = np.where(df['y_change_points'] >= 200, 1, 0)

#
# Create Train and Val datasets 
#
from sklearn.linear_model import LogisticRegression #x = df[['x_t-4', 'x_t-3', 'x_t-2', 'x_t-1']]
x = df[['x_t-2', 'x_t-1']]
y = df['y']
y_points = df['y_change_points'] # we will use this later 
# Note Fx "follows" (time series) so randomization is NOT a good idea
# create train and val datasets. 
no_train_samples = int(len(x) * 0.7)
x_train = x[4:no_train_samples]
y_train = y[4:no_train_samples]
y_train_change_points = y_points[4:no_train_samples]
x_val = x[no_train_samples:-3]
y_val = y[no_train_samples:-3]
y_val_change_points = y_points[no_train_samples:-3]

#
# Create class weights 
#
from sklearn.utils.class_weight import compute_class_weightnum_ones = np.sum(y_train)
num_zeros = len(y_train) - num_ones 
print('In the training set we have 0s {} ({:.2f}%), 1s {} ({:.2f}%)'.format(num_zeros, num_zeros/len(df)*100, num_ones, num_ones/len(df)*100))
classes = np.unique(y_train)
class_weight = compute_class_weight(class_weight='balanced', classes=classes, y=y_train)
class_weight = dict(zip(classes, class_weight))
print('class weights {}'.format(class_weight))

Next we will manually run the algorithm step by step and see what happens to the weights as we go. In this example we will use 500 data points at a time and “retrain” our algorithm on each block. After each training interval we will

display the new weights of the two input (feature) variables.
Using the last x and y of the block of data see if we can calculate a prediction and compare it to the SciKit probability prediction.

Note the “warm_start” switch in Logistic Regression tells the library that, when fitting, start from what we there before instead of from some random values ie build on what we already have learned.

#
# fit the model (step by step)
#def sigmoid(x):
 return 1 / (1 + np.exp(-x))
lr = LogisticRegression(warm_start=True)
start_ix=0
increments=500
x_list, y_list = [], []
while start_ix < (len(x_train) - increments):
 x = x_train.iloc[start_ix:start_ix+increments].to_numpy()
 y = y_train.iloc[start_ix: start_ix+increments].to_numpy() 
 lr.fit(x, y)
 intercept = float(lr.intercept_)
 coef_x1 = float(lr.coef_[0, 0])
 coef_x2 = float(lr.coef_[0, 1])
 x1 = float(x[-1, 0])
 x2 = float(x[-1, 1])
 predicted = float(lr.predict_proba(x[-1].reshape(1, 2))[0, 1])
 calculated = intercept + (coef_x1 * x1) + (coef_x2 * x2)
 print('ix: {}, x1: {:.5f}, x2: {:.5f}, y: {} int: {:.5f}, w1: {:.5f}, w2: {:.5f}, Calc: {:.5f}, CalSig: {:.5f}, Pred: {:.5f}'.format(start_ix+100, 
 x[0,0], x[0, 1], y[0],
 intercept, coef_x1, coef_x2, 
 calculated, sigmoid(calculated), predicted)) 
 start_ix += increments

Artificial Intelligence and Machine Learning for Foreign Exchange (Fx) Trading Part 3— Lifting the… (3)

You can see in the first few blocks the intercept and weights can change significantly. While they do “settle down” with time they do still move quite a bit. (that’s a clue we will be covering later). You can also see our manual calculation using the weights matches the prediction well so we know we are calculating things correctly.

This type of exercise is important to run through as it ensure you have your understanding and math correct (don’t have any fundamental mistakes) and may yield insights into what’s going on (there are some clues here).

Note if you reduce the block size down from 500 you may get some errors. The lbfgs algorithym needs at least one sample from each class (and 0 and 1) to make its determination with few and scattered 1’s that may not happen if the block size is small.

The decision boundary refers the line in which samples over or above it are classified as a 1 and below a 0. From the above we have two variables (x1, x2) and y which is actually yes or no and a formulae to predict the probability of y from x1 and x2. Hence we can chart these to see if yields any new insights.

Note this is why we limited the inputs to 2. Charting with 3 can get complex and if you can figure out a good way to do 4 please do let me know. In our final model we will have close to 20 features!

Note we have moved our code structure to use functions so we can “loop” through a few different iterations with hyperparameters and see what happens.

Firstly, we calculate the model parameters (intercept, weights etc.) and use them to calculate x2 (the graphs y axis) from x1 which is set to its min and max values. The formulae used -w1/w2 * x1_values — (b/w2) can be derived from knowing the decision boundary yeilds a probability of 0.5. Its a little complex but there are a few articles that do a good of explaining it.

# Retrieve the model parameters.def fit_and_get_parameters(x, y, class_weight):
 lr = LogisticRegression(class_weight=class_weight)
 lr.fit(x, y)
 b = float(lr.intercept_[0])
 w1, w2 = lr.coef_.T
 w1 = float(w1)
 w2 = float(w2)
 # Calculate the intercept and gradient of the decision boundary.
 c = float(-b/w2)
 m = float(-w1/w2)
 # get the min / max values of x1 and use to find decision boundary wtih x2 
 min_x1_value = x['x_t-1'].min()
 max_x1_value = x['x_t-1'].max()
 x1_values = np.array([min_x1_value, max_x1_value])
 x2_values = -w1/w2 * x1_values - (b / w2)
 print('y = {:.2f} + {:.2f} x1 + {:.2f} x2 Intercept(c): {:.2f}, Gradient(m): {:.3f} x1: {}, x2: {}'.format(b, w1, w2, c, m, x1_values, x2_values))
 return x1_values, x2_values

Then we can feed a “graph” function the “points” and decision bound start and end to plot.

def plot_decision_boundary(x, y, x1_values, x2_values, heading): # put 0's and 1's in two seperate lists for display 
 list_0_x1, list_0_x2, list_1_x1, list_1_x2 = [], [], [], []
 for ix in range(len(y)):
 if y.iloc[ix] == 0:
 list_0_x1.append(x['x_t-1'].iloc[ix])
 list_0_x2.append(x['x_t-2'].iloc[ix])
 else:
 list_1_x1.append(x['x_t-1'].iloc[ix])
 list_1_x2.append(x['x_t-2'].iloc[ix])
 # scaterplot the 0's and 1's
 plt.scatter(list_0_x1, list_0_x2, marker='o', color='blue')
 plt.scatter(list_1_x1, list_1_x2, marker='x', color='red')
 # Draw the decision boundary 
 plt.plot(x1_values, x2_values, linestyle='-', color='black')
 # axis labels 
 plt.xlabel('x1')
 plt.ylabel('x2')
 plt.title(heading)
 return

We then run the simulation

start_ix, stop_ix = 0, -1
x1_values, x2_values = fit_and_get_parameters(x_train.iloc[start_ix:stop_ix], y_train.iloc[start_ix:stop_ix], class_weight)
plot_decision_boundary(x_train.iloc[start_ix:stop_ix], y_train.iloc[start_ix:stop_ix], x1_values, x2_values, '{} to {} with db {} to {}'.format(start_ix, stop_ix, x1_values, x2_values)).iloc[start_ix:stop_ix], x1_values, x2_values)

We can run this for a few scenarios of the data with different start and stop values. I have charted some of the ones below.

Artificial Intelligence and Machine Learning for Foreign Exchange (Fx) Trading Part 3— Lifting the… (4)

Some key points

Smaller datasets can result in completely nonsensical information.
The values of the data points move over time. From 0.55 up to 0.85 depending upon the range. This is to be expected, the “price” of Fx does change over time often over a wide range. This is the main issue with our model at the moment as we need to normalize the data. Next article we are going to do just that.
The distribution of 1’s and 0's (red and blue) is almost uniform. Hence drawing a decision boundary looks like it only slightly better than guessing (it looks right through the middle with equal numbers of both sides).

As it stands our model is still pretty much completely useless but we have some insights we can develop.

the decision boundary runs through the middle with an almost uniform number of 1’s on both sides (so about as good as guessing)
The prices can move a lot and given our determination is a “weight” (ie y = w1 * t1 etc.) the office of a change in t1 can change the output completely.
This model is almost completely useless

We can start to get closer next article as we explore different types of normalization.

About logistic regression
For the decision boundary line calculation. https://scipython.com/blog/plotting-the-decision-boundary-of-a-logistic-regression-model/
For the decision boundary line calculation.
https://scipython.com/blog/plotting-the-decision-boundary-of-a-logistic-regression-model/
Github
https://github.com/the-ml-bull/hello_world
Youtube
https://youtu.be/14B6NH-kuM0
Twitter
@the_ml_bull
Part 1 — Hello World
https://medium.com/@the.ml.ai.bull/artificial-intelligence-and-machine-learning-for-foreign-exchange-fx-trading-f1e3c3efef78
Part 2 — Extending Hello World
https://medium.com/@the.ml.ai.bull/artificial-intelligence-and-machine-learning-for-foreign-exchange-fx-trading-part-2-extending-4d93347064a2

Artificial Intelligence and Machine Learning for Foreign Exchange (Fx) Trading Part 3— Lifting the… (2024)

FAQs

Does AI Forex trading work? ›

Conclusion. Artificial intelligence in forex marketing, along with recent developments in 2024, has provided huge support in trading. The AI forex market has reached the sky of success and gained a competitive place in cutting-edge technology in trading.

Find Out More ›

What is the best AI tool for Forex trading? ›

Some of the best AI forex trading platforms include MetaTrader 4, MetaTrader 5, cTrader, TradingView, and ProRealTime. Some of the most popular tools for AI trading include automated trading and copy trading tools, as well as market scanners such as those provided by Autochartist.

S. No.	Tool Name	Uses
1	EquBot	Analyze, Strategize
2	Trade Ideas	Scan, Identify
3	TrendSpider	Chart, Analyze
4	Tradier	Trade, Connect

Can machine learning predict forex? ›

Machine learning offers significant advantages for forex analysis. Its integration into forex prediction software may enhance trading strategies in several key ways: Real-Time Data Analysis: Algorithms excel in analysing vast amounts of real-time data, which is crucial for accurate forex daily analysis and prediction.

Learn More Now ›

Can I learn AI and machine learning on my own? ›

Can I learn AI on my own? Yes, you can learn AI development on your own, thanks to the vast amount of resources available online. Start with foundational topics such as machine learning, data science, and computer science. Practically apply what you learn in AI projects, available on platforms like Kaggle.

Learn More Now ›

Can I learn AI without coding? ›

Yes, you can learn AI on your own. There are plenty of online resources, tutorials, courses, and communities available that can help you acquire AI knowledge and skills independently.

Get More Info ›

How difficult is AI and machine learning? ›

AI algorithms also rely on statistics and mathematics. People who can't understand calculus, algebra, probability, etc., find AI quite hard to learn. But in reality, these things aren't as tricky. You just need proper guidance and practice for data handling, and nothing will seem as complicated as before.

Find Out More ›

Do forex bots make money? ›

In conclusion, the question of whether forex bots make money is complex and nuanced. While these bots offer the potential for profits through automation and algorithmic trading, success lies in understanding their limitations and using them judiciously as part of a comprehensive trading approach.

Find Out More ›

Is there a forex robot that works? ›

Waka Waka EA is an automated forex bot that runs on the popular MetaTrader platform. It uses advanced algorithms to analyze the markets, place trades, and manage open positions. The robot is renowned for its profitability and reliability, backed by a proven track record of success, showcasing over 100% annual returns.

Know More ›

Artificial Intelligence and Machine Learning for Foreign Exchange (Fx) Trading Part 3— Lifting the… (2024)

FAQs

Does AI Forex trading work? ›

Can machine learning predict forex? ›