A Step by Step Guide to Market Sentiment Analysis – Engineering Fear and Greed Index (Part 2)
In the first part of this article, we examined the concept of the Fear and Greed Index, its mathematical foundation, and the data collection process using Sectors Financial API. We identified eight indices—comprising both direct stock market indicators and indirect macroeconomic factors—that contribute to the overall sentiment index for the Indonesian stock market. With the data now organized into dataframes, we are ready to proceed with calculating each individual index. In this article, we will compute these indices and explore the final model for the overall Fear and Greed Index using a weighted average approach. Note that calculations involving external data sources are based on saved CSV files; the backend data processing pipeline will be covered in a separate article.
Market Momentum Index – IDX Composite Momentum
The Market Momentum Index measures the strength and direction of recent price movements in a stock market, indicating whether the market is trending upward or downward. It’s typically calculated using the rate of change in an index or stock prices over a set period, such as the difference in the IDX Composite over the past week or month. In the context of the Fear and Greed Index, high market momentum reflects investor optimism or “greed,” while negative or weak momentum signals caution or “fear” among investors, as it suggests a potential market downturn.
# Create a copy from API dataframe result generated from Sectors Financial API
daily_data_df_momentum = df_history_idx30.copy()
# Set up SMA period and min max scaling range
sma_period=7
min_momentum=-5
max_momentum=5
# Calculate the percentage difference from the SMA & handle zero division error
epsilon = 1e-9
daily_data_df_momentum['momentum_sma'] = ((daily_data_df_momentum['close'] - daily_data_df_momentum['sma']) / (daily_data_df_momentum['sma'] + epsilon)) * 100
# Remove any NaNs in case any
daily_data_df_momentum = daily_data_df_momentum.dropna(subset=['momentum_sma'])
# Scale the momentum to a 0-100 range for the Fear and Greed Index
def scale_to_100(value, min_val, max_val):
scaled_value = (value - min_val) / (max_val - min_val) * 100
return np.clip(scaled_value, 0, 100)
# Generate scaled market momentum index
daily_data_df_momentum['momentum_scaled'] = daily_data_df_momentum['momentum_sma'].apply(lambda x: scale_to_100(x, min_momentum, max_momentum))
# Calculate the market momentum Index on a daily basis
daily_momentum_index = daily_data_df_momentum.groupby('date')['momentum_scaled'].mean().reset_index()
# Print today's market momentum fear and greed index
print('Today´s Market Momentum Fear and Greed index is: ', daily_momentum_index.iloc[-1,-1])
From the code above we will have the result of daily market momentum fear and greed index and we can easily access today’s market momentum fear and greed index through the iloc function.
date momentum_scaled
0 2024-08-07 50.000000
1 2024-08-08 49.692673
2 2024-08-09 53.665698
3 2024-08-12 60.751728
4 2024-08-13 63.810015
... ... ...
58 2024-10-29 30.791101
59 2024-10-30 31.420214
60 2024-10-31 34.289018
61 2024-11-01 27.821455
62 2024-11-04 31.211632
We can plot the results using 50 as a threshold: values below 50 indicate fear, while values above 50 signify greed. Here we will show the visualization using market momentum index result as an example. The techniques to plot rest of the index is the same.
import matplotlib.pyplot as plt
import matplotlib.dates as mdates
fig, ax = plt.subplots(figsize=(12, 6))
ax.plot(daily_momentum_index['date'], daily_momentum_index['momentum_scaled'], color='blue', linewidth=2, label='Market Momentum Index')
# Highlight the "Fear" and "Greed" zones
ax.fill_between(daily_momentum_index['date'], 0, 50, color='red', alpha=0.2, label='Fear Zone')
ax.fill_between(daily_momentum_index['date'], 50, 100, color='green', alpha=0.2, label='Greed Zone')
# Formatting the x-axis
ax.xaxis.set_major_formatter(mdates.DateFormatter('%b %d'))
ax.xaxis.set_major_locator(mdates.WeekdayLocator(interval=1))
plt.xticks(rotation=45)
# Adding labels and title
plt.title("Market Momentum: Fear and Greed Index Over Time", fontsize=16, fontweight='bold')
plt.xlabel("Date", fontsize=12)
plt.ylabel("Momentum Scaled Index", fontsize=12)
plt.grid(visible=True, linestyle='--', alpha=0.6)
plt.legend()
plt.tight_layout()
plt.show()
Stock Price Strength
The Stock Price Strength Index reflects the proportion of stocks trading above their moving averages, such as the 50-day or 200-day average, indicating the overall health of the market. It’s calculated by dividing the number of stocks above their moving average by the total number of stocks in the index. In the context of the Fear and Greed Index, a high price strength suggests optimism or “greed” as more stocks are performing well, while a low value indicates “fear,” showing that a majority of stocks are trending downward.
# Create a copy of the original DataFrame
df_copy_ps = df_history_idx30.copy()
# Calculate the daily percentage change in 'close' price for each symbol on the copy
df_copy_ps['daily_return'] = df_copy_ps.groupby('symbol')['close'].pct_change() * 100
# Calculate the 7-day SMA of the daily returns on the copy
df_copy_ps['sma_7d_return'] = df_copy_ps.groupby('symbol')['daily_return'].transform(lambda x: x.rolling(window=7).mean())
# Handle zero division by replacing any infinite or NaN values in 'sma_7d_return' with zeros
df_copy_ps['sma_7d_return'].replace([np.inf, -np.inf], np.nan, inplace=True)
df_copy_ps['sma_7d_return'].fillna(0, inplace=True)
# Calculate the min and max of the SMA to normalize
min_sma = df_copy_ps['sma_7d_return'].min()
max_sma = df_copy_ps['sma_7d_return'].max()
# Avoid division by zero in scaling
if max_sma - min_sma == 0:
df_copy_ps['scaled_strength_index'] = 0
else:
df_copy_ps['scaled_strength_index'] = ((df_copy_ps['sma_7d_return'] - min_sma) / (max_sma - min_sma)) * 100
df_copy_ps[['symbol', 'date', 'scaled_strength_index']].dropna()
# Group by 'date' and calculate the mean of 'scaled_strength_index' for each day
daily_mean_strength_index = df_copy_ps.groupby('date')['scaled_strength_index'].mean().reset_index()
print ('Today´s Stock Price Strength Fear and Greed index is: ', daily_mean_strength_index.iloc[-1,-1])
After running this code, we should see this result below.
date scaled_strength_index
0 2024-08-07 42.589840
1 2024-08-08 42.589840
2 2024-08-09 42.589840
3 2024-08-12 42.589840
4 2024-08-13 42.589840
... ... ...
58 2024-10-29 35.441381
59 2024-10-30 33.525886
60 2024-10-31 32.744771
61 2024-11-01 29.398301
62 2024-11-04 30.021671
Stock Price Strength
The Volatility Index measures the degree of variation in stock prices over a specified period, often using the standard deviation of daily returns to quantify market uncertainty. It’s calculated by analyzing recent price fluctuations, typically over the past 30 days. In the context of the Fear and Greed Index, high volatility reflects “fear” as investors react unpredictably, while low volatility suggests “greed,” indicating a stable or confident market sentiment.
# Create a copy of the original DataFrame
df_copy_vol = df_history_idx30.copy()
# Calculate the daily percentage change in 'close' price for each symbol
df_copy_vol['daily_return'] = df_copy_vol.groupby('symbol')['close'].pct_change()
# Calculate the 7-day rolling standard deviation (volatility) of the daily returns for each symbol
df_copy_vol['volatility_7d'] = df_copy_vol.groupby('symbol')['daily_return'].transform(lambda x: x.rolling(window=7).std())
# Handle zero division by replacing any infinite or NaN values in 'volatility_7d' with zeros
df_copy_vol['volatility_7d'].replace([np.inf, -np.inf], np.nan, inplace=True)
df_copy_vol['volatility_7d'].fillna(0, inplace=True)
# Scale the volatility on a 0 to 100 scale for the fear and greed index context
min_vol = df_copy_vol['volatility_7d'].min()
max_vol = df_copy_vol['volatility_7d'].max()
# Avoid division by zero in scaling
if max_vol - min_vol == 0:
df_copy_vol['scaled_volatility_index'] = 0
else:
df_copy_vol['scaled_volatility_index'] = ((df_copy_vol['volatility_7d'] - min_vol) / (max_vol - min_vol)) * 100
# Handle boundary values
def transform_extremes_to_mean(df):
mean_value = df[(df['scaled_volatility_index'] != 0) & (df['scaled_volatility_index'] != 100)]['scaled_volatility_index'].mean()
df['scaled_volatility_index'] = df['scaled_volatility_index'].replace([0, 100], mean_value)
return df
df_copy_vol_transformed = transform_extremes_to_mean(df_copy_vol)
print(df_copy_vol_transformed)
print('Today´s Volatility Fear and Greed index is: ', df_copy_vol_transformed.iloc[-1,-1])
Upon running the code above, we will see this result.
date scaled_volatility_index
0 2024-08-07 15.320666
1 2024-08-08 15.320666
2 2024-08-09 15.320666
3 2024-08-12 15.320666
4 2024-08-13 15.320666
.. ... ...
58 2024-10-29 14.162872
59 2024-10-30 13.904232
60 2024-10-31 14.530508
61 2024-11-01 16.206552
62 2024-11-04 16.469431
Volume Breadth
The Volume Breadth Index assesses the strength of market participation by comparing the trading volume of advancing stocks to that of declining stocks. It’s calculated by taking the difference between the volume of advancing and declining stocks, often expressed as a ratio or percentage. In the context of the Fear and Greed Index, high volume breadth reflects “greed,” showing broad investor confidence, while low or negative volume breadth indicates “fear,” as fewer stocks experience strong trading activity.
# Create a copy of the dataframe result generated from Sectors Financial API
df_vb_copy = df_history_idx30.copy()
# Identify advancing and declining stocks
# Calculate whether the stock is advancing or declining based on the 'close' price
df_vb_copy['advancing'] = df_vb_copy.groupby('symbol')['close'].diff() > 0
df_vb_copy['declining'] = df_vb_copy.groupby('symbol')['close'].diff() < 0
# Calculate total volume of advancing and declining stocks for each date
# Advancing volume
df_vb_copy['advancing_volume'] = df_vb_copy['volume'] * df_vb_copy['advancing']
# Declining volume
df_vb_copy['declining_volume'] = df_vb_copy['volume'] * df_vb_copy['declining']
# Group by date to get total advancing and declining volumes for each day
daily_volume = df_vb_copy.groupby('date').agg({
'advancing_volume': 'sum',
'declining_volume': 'sum'
}).reset_index()
# Calculate Volume Breadth
# Avoid division by zero by handling cases where declining volume is zero
daily_volume['volume_breadth'] = daily_volume.apply(
lambda row: row['advancing_volume'] / row['declining_volume'] if row['declining_volume'] != 0 else np.nan,axis=1
)
# Apply the 7-day SMA to the Volume Breadth to smooth the values
daily_volume['sma_7d_vb'] = daily_volume['volume_breadth'].rolling(window=7).mean()
# Handle NaN or infinite values in the SMA with zeros (for stability)
daily_volume['sma_7d_vb'].replace([np.inf, -np.inf], np.nan, inplace=True)
daily_volume['sma_7d_vb'].fillna(0, inplace=True)
# Scale the Volume Breadth for the fear and greed index context
# Min and max for scaling
min_vb = daily_volume['sma_7d_vb'].min()
max_vb = daily_volume['sma_7d_vb'].max()
# Scale between 0 and 100, with protection against division by zero
if max_vb - min_vb == 0:
daily_volume['scaled_vb'] = 0
else:
daily_volume['scaled_vb'] = ((daily_volume['sma_7d_vb'] - min_vb) / (max_vb - min_vb)) * 100
# Replace 0 and 100 values with the mean
mean_scaled_vb = daily_volume['scaled_vb'].mean()
daily_volume['scaled_vb'] = daily_volume['scaled_vb'].apply(lambda x: mean_scaled_vb if x == 0 or x == 100 else x)
daily_volume[['date', 'scaled_vb']]
After running this code, we will see this result below.
date scaled_vb
0 2024-08-07 27.120935
1 2024-08-08 27.120935
2 2024-08-09 27.120935
3 2024-08-12 27.120935
4 2024-08-13 27.120935
... ... ...
58 2024-10-29 17.862489
59 2024-10-30 15.629806
60 2024-10-31 8.794240
61 2024-11-01 5.512156
62 2024-11-04 5.372818
Different from visualizing with only the matplotlib library, here we used seaborn to create a clean, grid-style background with sns.set(style=”whitegrid”), which enhances readability and presentation. This approach helps to visually separate the “Fear” and “Greed” zones with shaded areas, making the sentiment shifts in the Volume Breadth Index easier to interpret.
sns.set(style="whitegrid")
fig, ax = plt.subplots(figsize=(12, 6))
# Plot the daily mean volume breadth
ax.plot(daily_volume['date'], daily_volume['scaled_vb'], color='blue', linewidth=2, label='Mean Volume Breadth Index')
# Highlight the "Fear" and "Greed" zones
ax.fill_between(daily_volume['date'], 0, 50, color='red', alpha=0.2, label='Fear Zone')
ax.fill_between(daily_volume['date'], 50, 100, color='green', alpha=0.2, label='Greed Zone')
# Show dates on a weekly basis
ax.xaxis.set_major_formatter(mdates.DateFormatter('%b %d'))
ax.xaxis.set_major_locator(mdates.WeekdayLocator(interval=1)) # Show weekly dates
plt.xticks(rotation=45)
plt.title("Daily Mean Volume Breadth Index - Fear and Greed Context", fontsize=16, fontweight='bold')
plt.xlabel("Date", fontsize=12)
plt.ylabel("Scaled Volume Breadth Index (0-100)", fontsize=12)
plt.grid(visible=True, linestyle='--', alpha=0.6)
plt.legend(loc='upper left')
plt.tight_layout()
plt.show()
Safe Heaven Demand
The Safe Haven Demand Index measures investors’ preference for safer assets, like gold or government bonds, over stocks, indicating risk aversion. It’s calculated by tracking the price changes or trading volumes of these safe-haven assets compared to the broader stock market. In the context of the Fear and Greed Index, high demand for safe havens reflects “fear,” as investors shift away from stocks, while low demand suggests “greed,” indicating a stronger appetite for risk and confidence in the market.
# Create a copy of API generated dataframe
df_idx_shd = df_history_idx30.copy()
# Calculate average daily stock return
average_daily_return = df_idx_shd.groupby('date')['stock_return'].mean().reset_index()
average_daily_return.rename(columns={'stock_return': 'average_stock_return'}, inplace=True)
# import and load government bonds data
gov_bond_df = pd.read_csv('/content/temp_bonds_rate.csv')
# Ensure the dates columne are date format
average_daily_return['date'] = pd.to_datetime(average_daily_return['date'])
gov_bond_df['date'] = pd.to_datetime(gov_bond_df['date'])
# Perform inner join based on common dates
merged_data = pd.merge(average_daily_return, gov_bond_df, on='date', how='inner')
# Store the columns we need for calculation in one dataframe
merged_data
# Calculate the 7-day SMA for average_stock_return and rate
sma_period = 7
merged_data['sma_stock'] = merged_data['average_stock_return'].rolling(window=sma_period, min_periods=1).mean()
merged_data['sma_rate'] = merged_data['rate'].rolling(window=sma_period, min_periods=1).mean()
# Compute percentage distance from SMA for both stock returns and rates
merged_data['safe_haven_index'] = (
(merged_data['average_stock_return'] - merged_data['sma_stock']) / (merged_data['sma_stock'] + epsilon) -
(merged_data['rate'] - merged_data['sma_rate']) / (merged_data['sma_rate'] + epsilon)
) * 100
# Scale the Safe Haven Demand Index to 0-100 for Fear and Greed context
min_val, max_val = -5, 5 # Assume min/max range for scaling to 0-100
def scale_to_100(value, min_val, max_val):
scaled_value = (value - min_val) / (max_val - min_val) * 100
return np.clip(scaled_value, 0, 100)
# Replace any out-of-bounds values with the mean of the scaled index
mean_value = merged_data['safe_haven_scaled'].mean()
merged_data['safe_haven_scaled'] = merged_data['safe_haven_scaled'].apply(lambda x: mean_value if x <= 0 or x >= 100 else x)
merged_data[['date', 'average_stock_return', 'rate', 'safe_haven_scaled']]
The code outputs a dataframe that contains the average stock return, government bond rate, and the scaled Safe Haven Demand Index.
date average_stock_return rate safe_haven_scaled
0 2024-08-07 0.067625 6.806 50.000000
1 2024-08-08 -0.048310 6.775 54.205938
2 2024-08-09 0.587302 6.767 54.205938
3 2024-08-12 1.078921 6.800 54.205938
4 2024-08-13 0.656614 6.778 54.205938
... ... ... ... ...
58 2024-10-29 -0.576492 6.856 54.205938
59 2024-10-30 -0.527557 6.829 64.974123
60 2024-10-31 -0.105976 6.777 54.205938
61 2024-11-01 -1.487444 6.774 54.205938
62 2024-11-04 -0.582527 6.767 54.205938
Exchange Rate and Interest Rate
The Exchange Rate measures the relative strength of the Indonesian Rupiah (IDR) against foreign currencies, particularly the US Dollar (USD), providing insights into economic stability and investor confidence. It’s calculated by tracking daily fluctuations in the IDR-USD exchange rate. In the context of the Fear and Greed Index, a weakening IDR indicates “fear,” as investors may perceive increased economic risk, while a strengthening IDR suggests “greed,” reflecting economic stability and greater confidence in the market.
The Interest Rate reflects the central bank’s monetary policy stance and its influence on borrowing costs, economic growth, and inflation. It’s measured by tracking the daily policy rate set by Bank Indonesia. In the Fear and Greed Index, higher interest rates signal “fear,” as increased borrowing costs can dampen economic growth and market optimism. Conversely, lower interest rates reflect “greed,” encouraging investment and boosting confidence in the stock market.
To engineer the Fear and Greed Index for both Exchange Rate and Interest Rate, we analyze their trends over a fixed period by examining percentage changes that signal shifts in investor sentiment. For the exchange rate, we calculate the percentage change in the IDR-USD rate over time, with a weakening IDR suggesting “fear” and a strengthening IDR indicating “greed.” For the interest rate, which is typically issued monthly, we transform it into daily data by assigning the same rate for each day within the month. A rise in interest rates reflects “fear,” as higher borrowing costs can impact economic growth, while a decrease signals “greed,” encouraging investment. Both indicators are then scaled to a standardized 0-100 range for integration into the overall Fear and Greed Index, contributing to a comprehensive measure of investor sentiment.
Here is an example using exchange rate.
# load exchange rate csv file
er_df = pd.read_csv('/content/exchange_rate.csv')
# Apply SMA method
er_df['SMA_7'] = er_df['rate'].rolling(window=7).mean()
# Handle missing value
er_df['SMA_7'].fillna(er_df['SMA_7'].mean(), inplace=True)
# Scale the value from 0 to 100
from sklearn.preprocessing import MinMaxScaler
scaler = MinMaxScaler(feature_range=(0, 100))
er_df['scaled_fear_greed'] = scaler.fit_transform(er_df[['SMA_7']])
#Check today's exchange rate fear and greed index
print('Today´s Fear and Greed Index for Exchange Rate is: ', er_df.iloc[-1,-1])
Buffett Indicator
To engineer the Fear and Greed Index for the Buffett Indicator, we calculate the Buffett Indicator as the ratio of the stock market’s total market capitalization to Indonesia’s GDP, a commonly used measure to assess market valuation relative to the overall economy. A high Buffett Indicator suggests that the market is overvalued compared to economic output, signaling “fear” due to increased caution and risk of a potential correction. Conversely, a low Buffett Indicator implies the market is undervalued, indicating “greed” as stocks appear favorable relative to economic fundamentals. Since GDP is typically reported quarterly, we convert it to a daily measure by carrying forward the latest GDP figure until the next release, using Indonesia’s projected 2024 GDP of $1.49 trillion as a baseline. The Buffett Indicator ratio is then normalized to a 0-100 scale to align with the broader Fear and Greed Index, ensuring it accurately contributes to the overall sentiment measure.
# Create a copy dataframe from Sectors Financial API result
market_cap_df = df_history_idx30.copy()
# Group by 'date' and sum the 'market_cap' for each date
market_cap_df = market_cap_df.groupby('date')['market_cap'].sum().reset_index()
# Indonesian GDP for 2024 in Indonesian Rupiah (IDR)
# Using an approximate exchange rate: 1 USD = 15,000 IDR
usd_to_idr_exchange_rate = 15000
indonesia_gdp_idr = 1.47 * 10**12 * usd_to_idr_exchange_rate # 1.47 trillion USD to IDR
# Calculate Buffett Indicator
market_cap_df['buffett_indicator'] = (market_cap_df['market_cap'] / indonesia_gdp_idr) * 100
from sklearn.preprocessing import MinMaxScaler
# Calculate the 7-day Simple Moving Average (SMA) of the Buffett Indicator
market_cap_df['buffett_SMA_7'] = market_cap_df['buffett_indicator'].rolling(window=7).mean()
# Fill any missing values (NaNs) in the SMA with the mean of the SMA column
market_cap_df['buffett_SMA_7'].fillna(market_cap_df['buffett_SMA_7'].mean(), inplace=True)
# Scale the Buffett Indicator to a 0-100 range using MinMaxScaler
scaler = MinMaxScaler(feature_range=(0, 100))
# Scale by fitting and transforming on reshaped data
market_cap_df['buffett_scaled'] = scaler.fit_transform(market_cap_df[['buffett_SMA_7']])
# Apply adjustments to keep values within 0-100 range and handle edge cases
mean_scaled_value = market_cap_df['buffett_scaled'].mean()
market_cap_df['buffett_scaled'] = market_cap_df['buffett_scaled'].apply(
lambda x: mean_scaled_value if x <= 0 or x >= 100 else x
)
market_cap_df[['date', 'buffett_indicator_scaled']]
The result buffett indicator fear and greed index will look like this below.
date market_cap buffett_indicator buffett_SMA_7 buffett_scaled
0 2024-08-08 4613845450489856 20.924469 21.990788 77.802874
1 2024-08-09 4630499328786432 20.999997 21.990788 77.802874
2 2024-08-12 4649857907163136 21.087791 21.990788 77.802874
3 2024-08-13 4683195852259328 21.238983 21.990788 77.802874
4 2024-08-14 4709947255291904 21.360305 21.990788 77.802874
... ... ... ... ... ...
59 2024-10-31 4736517364776960 21.480804 21.893906 74.657994
60 2024-11-01 4690439897612288 21.271836 21.744455 69.806641
61 2024-11-04 4724848383229952 21.427884 21.644901 66.575003
62 2024-11-05 4741614034485248 21.503919 21.547028 63.397911
63 2024-11-06 1836315811250176 8.327963 19.593988 77.802874
Modelling for Final Indonesian Fear and Greed Index
Now that we have calculated all eight individual indices for the Fear and Greed Index, we have consolidated these results into a single dataframe, allowing us to model the final combined index. Our goal is to determine the weight or contribution of each index to the overall sentiment score, capturing both explicit contributions and any latent patterns among the indices. By identifying these weights, we aim to quantify how each index—such as market momentum, exchange rate, or safe haven demand—uniquely influences the Indonesian stock market’s sentiment landscape. This modeling step is critical to ensure that the final Fear and Greed Index accurately reflects the combined impact of diverse economic and market indicators.
Given the absence of a pre-defined “ground truth” for an overall fear and greed score, we employ unsupervised learning to explore and derive these patterns without labeled data. Unsupervised methods like PCA, ICA, and Factor Analysis are well-suited for this problem, as they allow us to capture the underlying structure and relationships among the indices. PCA helps to maximize variance, revealing the most significant contributors and reducing dimensionality without losing essential information. ICA, in contrast, emphasizes statistical independence, which can be valuable if each index captures distinct sentiment drivers. Factor Analysis, meanwhile, identifies latent factors, assuming correlations among indices that reflect broader, shared sentiment influences. By applying these methods, we can construct a well-rounded and empirically-based Fear and Greed Index, balancing variance, independence, and latent factors.
Principle Component Analysis
from sklearn.preprocessing import StandardScaler
# Standardize the data to have mean 0 and standard deviation 1
scaler = StandardScaler()
numeric_df_scaled = scaler.fit_transform(numeric_df)
from sklearn.decomposition import PCA
# Initialize PCA and fit to the scaled data
pca = PCA(n_components=8) # We are keeping all indices initially
pca.fit(numeric_df_scaled)
# Get the explained variance ratio for each principal component
explained_variance = pca.explained_variance_ratio_
# Display the explained variance
for i, variance in enumerate(explained_variance, 1):
print(f"Principal Component {i}: {variance:.2%} of the variance explained")
We will see below how each component contributes to the variance.
Principal Component 1: 33.96% of the variance explained
Principal Component 2: 26.98% of the variance explained
Principal Component 3: 12.95% of the variance explained
Principal Component 4: 10.56% of the variance explained
Principal Component 5: 6.77% of the variance explained
Principal Component 6: 4.34% of the variance explained
Principal Component 7: 3.14% of the variance explained
Principal Component 8: 1.30% of the variance explained
Next, we will get the loadings and calculate the absolute contributions and normalize them to get percentage.
# Get the loadings (components) and focus on the first component
loadings = pca.components_[0] # First principal component
variable_names = numeric_df.columns
# Calculate the absolute contributions and normalize them to get percentages
loadings_percent = 100 * abs(loadings) / abs(loadings).sum()
# Display the contribution of each variable
contributions = pd.DataFrame({'Variable': variable_names, 'Contribution (%)': loadings_percent})
contributions.sort_values(by='Contribution (%)', ascending=False, inplace=True)
PCA suggests that the price strength, market momentum, and volume breadth indices are the primary contributors, collectively explaining over 70% of the variance. This indicates that market performance indicators are the most significant drivers of sentiment in the Fear and Greed Index. Exchange rate and buffett indicator contribute moderately, with the smallest contributions from interest rate, safe haven demand, and volatility. This distribution implies that macroeconomic factors like interest rates and volatility are less central to market sentiment under this method, making PCA suitable if the goal is to capture variance driven primarily by direct market indicators.
Variable Contribution (%)
2 price_strength 24.686142
0 market_momentum 23.786931
4 volumn_breadth 23.425010
6 exchange_rate 12.510847
3 buffett_indicator 8.480610
7 interest_rate 3.424652
5 safe_heaven 2.084997
1 volatility 1.600812
Independent Component Analysis
from sklearn.decomposition import FastICA
from sklearn.preprocessing import StandardScaler
import pandas as pd
# Standardize the data to mean 0 and standard deviation 1
scaler = StandardScaler()
numeric_df_scaled = scaler.fit_transform(numeric_df)
# Apply ICA
ica = FastICA(n_components=8, random_state=42)
ica_components = ica.fit_transform(numeric_df_scaled) # Transformed independent components
# Get the mixing matrix, which contains the contributions of each index to each independent component
ica_mixing = ica.mixing_
# Extract and normalize the first component to get the contribution percentage for each index
ica_contributions = 100 * abs(ica_mixing[:, 0]) / abs(ica_mixing[:, 0]).sum()
ica_contributions_df = pd.DataFrame({'Variable': variable_names, 'ICA Contribution (%)': ica_contributions})
# Sort and display the contributions
ica_contributions_df.sort_values(by='ICA Contribution (%)', ascending=False, inplace=True)
print("Independent Component Analysis (ICA) Contributions:n", ica_contributions_df)
ICA highlights price strength (35%) and market momentum (29%) as the most independent drivers of sentiment, with buffett indicator and volume breadth also playing moderate roles. However, exchange rate, volatility, and interest rate contribute minimally, showing that they provide little unique, independent information for sentiment. ICA’s emphasis on price strength and market momentum suggests that these indices capture distinct, non-overlapping signals that directly impact investor sentiment. ICA is advantageous if we prioritize unique, independent indicators of fear and greed, as it filters out redundancy.
Independent Component Analysis (ICA) Contributions:
Variable ICA Contribution (%)
2 price_strength 35.217306
0 market_momentum 29.382728
3 buffett_indicator 16.666646
4 volumn_breadth 11.988487
6 exchange_rate 4.112347
1 volatility 1.760180
5 safe_heaven 0.723589
7 interest_rate 0.148717
*Factor Analysis
# Apply Factor Analysis
factor_analyzer = FactorAnalysis(n_components=2, random_state=42) # Assume 2 latent factors for simplicity
factor_analyzer.fit(numeric_df_scaled)
# Extract factor loadings
factor_loadings = factor_analyzer.components_
# Create a DataFrame of factor loadings
factor_loadings_df = pd.DataFrame(factor_loadings.T, index=variable_names, columns=['Factor1', 'Factor2'])
factor_loadings_df
Factor1 Factor2 Factor1 Contribution (%)
market_momentum -0.885500 -0.006721 26.152412
volatility 0.078254 0.734272 2.311153
price_strength -0.925022 0.158749 27.319637
buffett_indicator -0.197961 0.579523 5.846599
volumn_breadth -0.747953 0.075496 22.090072
safe_heaven 0.045285 -0.182631 1.337452
exchange_rate -0.441639 -0.755679 13.043400
interest_rate -0.064308 0.359978 1.899275
# Calculate contribution percentages for the first factor to interpret as potential weights
factor1_contributions = 100 * abs(factor_loadings_df['Factor1']) / abs(factor_loadings_df['Factor1']).sum()
factor_loadings_df['Factor1 Contribution (%)'] = factor1_contributions
# Display factor loadings and contributions
print("Factor Analysis Loadings:n", factor_loadings_df)
Factor Analysis provides a balanced view, revealing that price strength, market momentum, and volume breadth are also the dominant components in the primary latent factor (Factor 1). However, exchange rate and buffett indicator contribute more here than in PCA or ICA, suggesting that these indices are part of a correlated sentiment factor that includes macroeconomic stability indicators. Volatility also emerges as significant in a secondary factor, indicating a separate dimension of risk sentiment. Factor Analysis thus implies that sentiment is multifaceted, with broader market indicators driving one aspect and macroeconomic factors influencing another, more risk-oriented dimension.
Factor Analysis Loadings:
Factor1 Factor2 Factor1 Contribution (%)
market_momentum -0.885500 -0.006721 26.152412
volatility 0.078254 0.734272 2.311153
price_strength -0.925022 0.158749 27.319637
buffett_indicator -0.197961 0.579523 5.846599
volumn_breadth -0.747953 0.075496 22.090072
safe_heaven 0.045285 -0.182631 1.337452
exchange_rate -0.441639 -0.755679 13.043400
interest_rate -0.064308 0.359978 1.899275
Final Decision
In conclusion, after comparing the results of the PCA, ICA, and Factor Analysis models, the final decision is to use the mean of the three sets of weights as the basis for constructing the overall Fear and Greed Index for the Indonesian stock market. This approach balances the distinct insights each model offers, combining variance maximization, independence detection, and latent factor representation to create a more comprehensive sentiment index.
Each model provides unique advantages. PCA highlights the indices that explain the most variance in the data, underscoring the influence of indices like market momentum and exchange rate on the overall sentiment. This approach captures the dominant factors but may overlook unique contributions from less correlated indices. ICA, on the other hand, identifies independent sources of sentiment, emphasizing the influence of volume breadth and exchange rate as distinct drivers of market sentiment. While ICA offers valuable insights into independent sentiment components, it may understate the influence of interrelated factors that affect multiple indices simultaneously. Factor Analysis provides a middle ground by identifying latent factors shared among indices, showing that market momentum, price strength, and volume breadth align closely with a common sentiment factor, while volatility and interest rate represent a separate dimension of market risk.
By averaging the weights derived from these three methods, we benefit from the comprehensive view provided by PCA, the independence-focused insights of ICA, and the factor correlation perspectives of Factor Analysis. This blended approach creates a balanced and robust Fear and Greed Index that is sensitive to dominant, independent, and correlated drivers of market sentiment, ensuring that the final model is both versatile and representative of diverse market influences. Using the mean weights allows us to incorporate the strengths of each model, resulting in a more resilient and nuanced measure of sentiment for the Indonesian stock market.
Variable PCA Contribution (%) ICA Contribution (%)
0 price_strength 24.686142 35.217306
1 market_momentum 23.786931 29.382728
2 volumn_breadth 23.425010 11.988487
3 exchange_rate 12.510847 4.112347
4 buffett_indicator 8.480610 16.666646
5 interest_rate 3.424652 0.148717
6 safe_heaven 2.084997 0.723589
7 volatility 1.600812 1.760180
Factor1 Contribution (%) Weighted Average (%)
0 27.319637 29.074362
1 26.152412 26.440690
2 22.090072 19.167856
3 13.043400 9.888865
4 5.846599 10.331285
5 1.899275 1.824215
6 1.337452 1.382013
7 2.311153 1.890715