% Template file for an a0 portrait poster.
% Written by Graeme, 2001-03 based on his SOC poster.
%
% See discussion and documentation at
% <http://www.astro.gla.ac.uk/users/norman/docs/posters/> 
%
% $Id: poster-template-portrait.tex,v 1.2 2002/12/03 11:25:55 norman Exp $


% We switch to portrait mode. This works as advertised.
\documentclass[a0,portrait]{a0poster}
% You might find the 'draft' option to a0 poster useful if you have
% lots of graphics, because they can take some time to process and
% display. (\documentclass[a0,draft]{a0poster})

% Switch off page numbers on a poster, obviously, and section numbers too.
\pagestyle{empty}
\setcounter{secnumdepth}{0}

% The textpos package is necessary to position textblocks at arbitary 
% places on the page.
\usepackage[absolute]{textpos}

% Graphics to include graphics. Times is nice on posters, but you
% might want to switch it off and go for CMR fonts.
\usepackage{graphics,wrapfig,times}

% These colours are tried and tested for titles and headers. Don't
% over use color!
\usepackage{color}
\definecolor{DarkBlue}{rgb}{0.1,0.1,0.5}
\definecolor{Red}{rgb}{0.9,0.0,0.1}

% see documentation for a0poster class for the size options here
\let\Textsize\normalsize
\def\Head#1{\noindent\hbox to \hsize{\hfil{\LARGE\color{DarkBlue} #1}}\bigskip}
\def\LHead#1{\noindent{\LARGE\color{DarkBlue} #1}\smallskip}
\def\Subhead#1{\noindent{\large\color{DarkBlue} #1}}
\def\Title#1{\noindent{\VeryHuge\color{Red} #1}}


% Set up the grid
%
% Note that [40mm,40mm] is the margin round the edge of the page --
% it is _not_ the grid size. That is always defined as 
% PAGE_WIDTH/HGRID and PAGE_HEIGHT/VGRID. In this case we use
% 15 x 25. This gives us a wide central column for text (7 grid
% spacings) and two narrow columns (3 each) at each side for 
% pictures, separated by 1 grid spacing.
%
% Note however that texblocks can be positioned fractionally as well,
% so really any convenient grid size can be used.
%
\TPGrid[40mm,40mm]{15}{25}  % 3 - 1 - 7 - 1 - 3 Columns

% Mess with these as you like
\parindent=0pt
%\parindent=1cm
\parskip=0.5\baselineskip

% abbreviations
%\newcommand{\ddd}{\,\mathrm{d}}

\begin{document}

% Understanding textblocks is the key to being able to do a poster in
% LaTeX. In
%
%    \begin{textblock}{wid}(x,y)
%    ...
%    \end{textblock}
%
% the first argument gives the block width in units of the grid
% cells specified above in \TPGrid; the second gives the (x,y)
% position on the grid, with the y axis pointing down.

% You will have to do a lot of previewing to get everything in the 
% right place.

% This gives good title positioning for a portrait poster.
% Watch out for hyphenation in titles - LaTeX will do it
% but it looks awful.
\begin{textblock}{12}(0,0)
\baselineskip=3\baselineskip \Title{Neural Network Model for
\\Thermal Conductivity of Steels}
\end{textblock}

\begin{textblock}{12}(0,1.5)
\LHead{Mathew Peet, Hala Salman Hasan \\
\texttt{mjp54@cam.ac.uk, hh313@cam.ac.uk}}
\end{textblock}

% Put the GU logo in the top right.
\begin{textblock}{3}(12,0)
\resizebox{3\TPHorizModule}{!}{\includegraphics{CUnibig.pdf}}
\end{textblock}


% An example text block, to get you started!
\begin{textblock}{7}(0,2.4)
  \LHead{Introduction}
  
  Thermal conductivity is an important parameter in the heat treatment and use of steels. Temperature gradients during cooling can lead to microstructural gradients and to residual stresses in steel components. Thermal transients can influence the development of stresses reducing service life and safety. %Thermal conductivity is also major factor determining the mechanical properties of weldments, determining size of heat effected zone and cooling rate.

  The impetus for the development of our model was to provide thermal conductivity values for the design of a steel quenching probe. The critical dimensions of which scale linearly with the thermal conductivity, therefore a probe made from steel has to be proportionally smaller than commonly used standard probes made from silver or aluminium. Having a model for thermal conductivity would allow us to investigate the heat transfer coefficient of any steel, rather than being limited to only those with available data. 
 
  Previous work on the thermal conductivity of steel shows that there is a wide variation in the thermal conductivity as a function of composition. Presumably due to the complexity, there is very little fundamental research on the effect of alloying elements and temperature upon the thermal conductivity. There is a large amount of relatively low-quality data available which gives the thermal conductivity for particular grades of steel. This data is produced by steel suppliers for steel selection purpose, each steel grade in reality represents a range of compositions, and very few details of the microstructure at each temperature tested are available. 


  Due to the complexity of the composition dependence and the lack of any existing physical model, it is appropriate to proceed by developing a neural network model.
  
\end{textblock}


% Another text block in the bottom left.
\begin{textblock}{7}(0,7.8)
  \LHead{Bayesian Neural Networks}

  To enable the modelling of thermal conductivity for steels of arbitrary composition, a database was collated of 223 steels with compositions including 15 different elements.
  
  The neural network was used as a general form of regression, as previously applied to many problems in materials science~\cite{bhadeshia-review1, bhadeshia-review2}. The neural network used has been developed in a statistical framework, it is able to automatically infer the appropriate complexity of the model~\cite{mackaythesis}. This helps avoid the problems of over-fitting the very flexible equations used in neural network models.


\resizebox{3\TPHorizModule}{!}{\includegraphics{neural9b.pdf}}


\textcolor{DarkBlue}{Structure of the three layer neural network}

\end{textblock}


\begin{textblock}{3.4}(3.7,10.1)
The output variable is expressed as a linear summation of activation functions, $h_i$, with weights $w_i$ and the bias $\theta$.
\begin{equation}
y = \sum_i w_i h_i + \theta
\label{hyper1}
\end{equation}

with the activation function for a neuron $i$ in the hidden layer given by,
\begin{equation}
h_i = \mathrm{tanh} \left(  \sum_j w_{ij} x_j + \theta_i \right)
\label{hyper2}
\end{equation}

with weights $w_{ij}$ and biases $\theta_i$. The weightings are simplified by normalising the data within the range $\pm$0.5 using the normalisation function, $x_j = x - x_{\mathrm{min}}/ x_{\mathrm{max}} - x_{\mathrm{min}}$ - 0.5, where $x$ is the value of the input and $x_j$ is normalised value.

\end{textblock}


\begin{textblock}{7}(0,15.3)
In the Bayesian neural network~\cite{mackaythesis} `training' is achieved by altering the parameters by back-propagation~\cite{graddescentbackprop} to optimise an objective function which combines an error term ($E_D$) to assess how good the fitting is and regularisation term ($E_W$) to penalise large weights, 

\begin{equation}
\mathrm{M}(w) = \beta  \left(\frac{1}{2}\sum_i (t^{(i)}-y^{(i)})^2 \right) + \alpha \left(\frac{1}{2}\sum_i w_i^2\right) 
\end{equation}
where $\beta$ and $\alpha$ are complexity parameters which greatly influence the complexity of the model, $t^{(i)}$ and $y^{(i)}$ are the target and corresponding output values for one example input from the training data $x^{(i)}$.

This automatically infers over--complex and under--regularised models to be less probable, even though the flexibility of equation~\ref{hyper1} allows them to fit the data better. Assuming that the uncertainty about the output $y$ has a Gaussian distribution, the size of the error bars $\sigma^2_u$ can be calculated from the Hessian of the parameters by,

\begin{equation}
\sigma^2_u = \mathrm{G}^\mathrm{T}_{(u)}\mathrm{A}^{-1}\mathrm{g}_{(u)}
\end{equation}
where $\mathrm{g}_{(u)}$ is $\partial y/\partial \mathrm{w}$ evaluated at $\mathrm{x}^{(u)}$~\cite{mackay-data-selection}.


Other modelling procedures also help to produce a robust model, such as the use of training and testing sets, and the formation of a committee of sub-models each converging from different positions in parameter space. After training the models can than be assessed by testing if the trends in the predictions are as expected, and more objectively by the prediction of unseen data. A major advantage of the approach is that is allows the calculation of error-bars which vary in size depending on the position in the input-space and indicating the confidence in the predictions.
\end{textblock}


% Another text block in the top right.
\begin{textblock}{7}(8,2.2)
  \LHead{Predictive ability}

The model is found to be able to generalise sufficiently to reproduce the general trends in the data, and be capable of making useful predictions of unseen compositions. Here we compare the predictions of the model against data for a ferritic steel and an austenitic stainless steel used in the nuclear industry~\cite{Leibowitz1988}. In these cases it can be seen that the measured values lie completely within the error bars of the model, even though the exact variation as a function of temperature reported is not matched, particularly for the ferritic steel. The difference in the prediction for the ferritic steel is similar to the experimental differences reported in various papers.

\begin{center}
\resizebox{6\TPHorizModule}{!}{\includegraphics{test-ss/D9.pdf}}


\resizebox{6\TPHorizModule}{!}{\includegraphics{test-ss/HT9.pdf}}
\end{center}
\textcolor{DarkBlue}{Predictions for Sandvik alloys D9 (Fe-15.5Ni-13.5Cr-2Mn-2Mo-0.75Si-0.25Ti-0.04C Wt\%) and HT9 (Fe-0.5Ni-12Cr-0.2Mn-1Mo-0.25Si-0.5W-0.5V-0.2C Wt\%)}

The general performance of the model can be tested by predicting on unseen data, these were grouped into those within the range of data used for training and those outside the range, this does not necessarily classify them as interpolation and extrapolation because they can have the elements in different combinations. As can be seen from the table, the perceived error (1 standard deviation) matched well with the root mean squared error.

\begin{center}
\begin{table}
\begin{tabular}{|l|c|c|}
\hline
Data set                           & Perceived Error & Root mean squared error\\   
\hline
Unseen data within range of model          &	5.5          &  6.1		\\
Data beyond range of model         &    82.3         &   50.8           \\
\hline
\end{tabular}
\end{table}
\end{center}

\end{textblock}


\begin{textblock}{7}(8,17.5)
\LHead{Conclusions}

A model has been developed which can predict the thermal conductivity of steels, along with meaningful estimate of the accuracy of the predictions. The model is publicly available online~\cite{model}.

In future work it may be possible to improve the model by including calculation of physically meaningful parameters. For example the equilibrium volume fraction of austenite, cementite and ferrite could be included to attempt to distinguish the effect of the different components as they vary as a function of temperature.

It seems likely that any significant improvement to the model would require new experiments to be performed to measure the effect of microstructure, which would be required to model thermal conductivity changes as a function of time and temperature.
\end{textblock}

% Another text block in the bottom right.
\begin{textblock}{7}(8,20.6)
 \LHead{References}
\end{textblock}

\begin{textblock}{7}(8,20.6)
\renewcommand{\refname}{}
\bibliography{TC-ref}
\bibliographystyle{unsrt}

\end{textblock}


% If you want to add a figure do something like this:

%\begin{textblock}{3}(0,15)
%  \begin{center}
%\resizebox{3\TPHorizModule}{!}{\includegraphics{my_figure.eps}}
%\\Figure 5: Googles per Snark (renormalised with wild angry men
%  \end{center}
%\end{textblock}


% Place the group logo at the bottom left - visually this balances
% well with the University logo at the top right. 
\begin{textblock}{4}(0.3,22.2)
  \begin{center}
 
\resizebox{1.5\TPHorizModule}{!}{\includegraphics{holly.pdf}}
\color{Red}\\Phase Transformations and Complex Properties Group\\Department of Materials Science and
  Metallurgy\\University of Cambridge
  \end{center}
\end{textblock}

\end{document}