ImageVerifierCode 换一换
格式:PPT , 页数:1 ,大小:977KB ,
资源ID:389497      下载积分:2000 积分
快捷下载
登录下载
邮箱/手机:
温馨提示:
如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
如填写123,账号就是123,密码也是123。
特别说明:
请自助下载,系统不会自动发送文件的哦; 如果您已付费,想二次下载,请登录后访问:我的下载记录
支付方式: 支付宝扫码支付 微信扫码支付   
注意:如需开发票,请勿充值!
验证码:   换一换

加入VIP,免费下载
 

温馨提示:由于个人手机设置不同,如果发现不能下载,请复制以下地址【http://www.mydoc123.com/d-389497.html】到电脑端继续下载(重复下载不扣费)。

已注册用户请登录:
账号:
密码:
验证码:   换一换
  忘记密码?
三方登录: 微信登录  

下载须知

1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。
2: 试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。
3: 文件的所有权益归上传用户所有。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 本站仅提供交流平台,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

版权提示 | 免责声明

本文(Supported by the NSF Division of Materials Research.ppt)为本站会员(diecharacter305)主动上传,麦多课文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文库(发送邮件至master@mydoc123.com或直接QQ联系客服),我们立即给予删除!

Supported by the NSF Division of Materials Research.ppt

1、,Supported by the NSF Division of Materials Research,The Materials Computation Center Duane D. Johnson and Richard M. Martin (PIs) Funded by NSF DMR 03-25939,D.D. Johnson, D.E. Goldberg, and P. Bellon Students: Kumara Sastry (MSE/GE), Jia Ye (MSE) Departments of Materials Science and Engineering and

2、 General Engineering University of Illinois at Urbana-Champaign,Multiscale Modeling Methods for Materials Science and Quantum Chemistry,Multiscaling via Symbolic Regression,Genetic Programming: Machine-Learning Method for Multiscale Modeling,Ab Initio Accurate Semiempirical Quantum Chemistry Potenti

3、als via Multi-Objective GAs,First, what is a Genetic Programming (GP)? A Genetic Program is a genetic algorithm that evolves computer programs, requiring: Representation: programs represented by trees Internal nodes contain functionse.g., +, -, *, /, , log, exp, sin, AND, if-then-else, for Leaf node

4、s contain terminalse.g., Problem variables, constants, Random numbers Fitness function: Quality measure of the program Population: Candidate programs (set of individuals) Genetic operators: Selection: “Survival of the fittest”. Recombination: Combine parents to create offspring. Mutation: Small rand

5、om modification of offspring.,Goal: Evolve constitutive “law” between macroscopic variables from stress-strain data with multiple strain-rates for use in continuum finite-element modeling. Flow stress vs. temperature-compensated strain rate for AA7055 Aluminum Padilla, et al. (2004).GP fits both low

6、- and high strain-rate data well by introducing (effectively) a step-function between different strain-rate even though no knowledge of two sets of strain-rate data were indicated to GP. Automatically identified transition point via a complex relation, g, which models a step function between strain-

7、rates involved.GP identifies “law” with two competing mechanisms5-power law modeling known creep mechanism4-power law for as-yet-unknown creep mechanism.,1. Evolving Constitutive Relations,2. Multi-Timescale Kinetics Modeling Goal: To advance dynamics simulation to experimentally relevant time scale

8、s (seconds) by regressing the diffusion barriers on the PES as an in-line function.Molecular Dynamic (MD) or Kinetic Monte Carlo (KMC) based methods fall short 39 orders of magnitude in real time.Unless ALL the diffusion barriers are known in a look-up table.Table KMC has109 increase in “simulated t

9、ime” over MD at 300K.Our new “Symbolically-Regressed” KMC (sr-KMC) Use MD to get some barriers. Machine learn via GP all barriers as a regressed in-line function call, i.e. “table-look-up” KMC is replaced by function.,Summary Our results indicate that GP-based symbolic regression is an effective and

10、 promising tool for multiscaling. The flexibility of GP makes it readily amenable to hybridization with other multiscaling methods leading to enhanced scalability and applicability to more complex problems. Unlike traditional regression, GP adaptively evolves both the functional relation and regress

11、ion constants for transferring key information from finer to coarser scales, and is inherently parallel.,GP predicts all barriers with 0.1% error from explicit calculations of only 3% of the barriers. (Standard basis-set regressions fail.)GP symbolic-regression approach yields:102 decrease in CPU ti

12、me for barrier calculations.102 decrease in CPU over table-look-ups (in-line function call).104107 less CPU time per time-step vs. on-the-fly methods (note that each barrier calculation requires 10 s with empirical potential, 1800 s for tight-binding, and first-principles even more).(Future) Could c

13、ombine with pattern-recognition methods (e.g., T. Rahman et al.), or temperature-accelerated MD, to model more complex cooperative dynamics.(Current) Utilize the GP in-line table function obtain from tight-binding potential in a kinetic Monte Carlo simulation for this surface alloy vacancy-assisted

14、diffusion.,D.D. Johnson, T.J. Martinez, and D.E. Goldberg, Students: Kumara Sastry (MSE/GE) and Alexis L. Thompson (Chemistry) Departments of Materials Science and Engineering, Chemistry, and General Engineering University of Illinois at Urbana-Champaign,Goal: Functional augmentation and rapid multi

15、-objective reparameterization of semi-empirical methods to obtain reliable pathways for excited-state reaction chemistry.Ab Initio methods: accurate, but highly expensive.Semi-Empirical (SE) methods: approximate, but very inexpensive.Reparameterization based on few ab initio calculated data sets inv

16、olving excitations of a molecule, rather than low-energy (Born-Oppenheimer) states, e.g. use MNDO-PM3 Hamiltonian and find the MNDO parameters specific to particular molecular system.Involves optimization of multiple objectives, such as fitting simultaneously limited ab initio energy and energy-grad

17、ients of various chemical excited-states or conformations.(Future) Augmentation of functions may be needed.Propose: Multi-objective GAs for reparameterizationNon-dominate solutions represent physically allowed solutions, whereas dominant solutions can lead to unphysical solutions.Obtain set of Paret

18、o non-dominate solutions in parallel, not serially.Avoid potentially irrelevant pathways, arising from SE-forms, so as to reproduce more accurate reaction paths.(Future) Use Genetic Programming for functional augmentation, e.g., symbolic regression of core-core repulsions.Advantages of GA/GP Multi-O

19、bjective Optimizations, method is:robust, and yields good quality solutions quickly, reliably, and accurately, converges rapidly to Pareto-optimal ones, maintain diverse populations, suited to finding diverse solutions, niche-preserving methods may be employed, implicitly parallel search method, unl

20、ike applications of classic methods.,Kumara Sastry, D.D. Johnson, D.E. Goldberg, and P. Bellon, Int. J. of MultiScale Computational Engineering 2 (2), 239-256 (2004).,Analytic Estimate of Population Size vs. Empirical Results: Population size (no. of solutions kept to evolve) is a critical factor to

21、 ensure reliable solution. Shown is the probability that at least one copy of all raw subcomponents appear in population vs population size, n, for different tree sizes =2h, for the later diffusion example. Finding: population of 150-200 is enough.,Getting the Problems Measure of Fitness Problem-dep

22、endent choice: e.g., for diffusion, choose weighted (wi) least-squares fit of GP-derived vs. M calculated barriers, where wi = |EMD|1 as lower-energy barriers are more accessible than high-energy ones. Fit could to experimental data, too.,Getting the Problems Optimal Population Size,Application: Sur

23、face-vacancy-assisted diffusion in segregating CuxCo1-x.Using Molecular Dynamics based on density-functional, tight-binding, or empirical potentials, we calculate M (un)relaxed saddle-point energies E(xi) for atoms surrounding a vacancy with first and second neighbor environment denoted by 0 or 1 (f

24、or binary alloys) in a vector xi.GP evolves in-line barrier function and predicts remaining unknown barriers.Newly predicted low-energy barriers are calculated directly by MD as verification step. If correct, use barrier function. If not correct, now have new barrier in a M+1 learning set. Repeat cy

25、cle (MD is +99.9% of step).,K. Sastry, H.A. Abbass, D.E. Goldberg, D.D. Johnson, “Sub-structural Niching in Estimation Distribution Algorithms,“ Genetic and Evolutionary Computation Conference (2005).Kumara Sastry, D.D. Johnson, D.E. Goldberg, and P. Bellon, “Genetic programming for multitimescale m

26、odeling,“ Phys. Rev. B 72, 085438-9 (2005). *chosen by the AIP Editors as focused article of frontier research in Virtual Journal of Nanoscale Science & Technology, Vol 12, Issue 9 (2005).,Getting the Problems Basis Functions Using these operations a tree-like code is self-generated and provides mac

27、hine-learned “basis functions” and their “coefficients” (by fitting to some measure of fitness, e.g., comparing calculated and GP-derived diffusion barriers). Example “leaf of the tree” (term in basis) created via the above “genetic operators”, where (a) and (b) leaves created (e) and (f).,What is N

28、on-Dominant Solutions on Pareto-Optimal Front? Using a MNDO method for Benzene C6H6 requires 11 parameters, if the H parameters are fixed. To fit accurately CASPT2 results for two objectives (energy and energy-gradient errors) on the excited-state potential energy surface (Frank-Condon region), the

29、11 parameters are globally optimized keeping a population of solutions to evolve and the solutions at the nose of the Pareto are accepted as best solutions.,Semi-empirical potential parameterizations lead to differing solutions, or competing solutions. Using GA/GP we can find optimal potentials and

30、avoid pathways from dominating but irrelevant solutions.e.g., solution C is dominate over B in error in energy.solutions A and B are non-dominate in multiobjectives.Pareto front solutions are denoted by red circles and all represent potentially “good” solutions all with different parameter sets for

31、the empirical potentialFor physical reasons, the “nose” of Pareto front (between A and B) gives the most optimal solution desired for quantum-chemistry applications.,Biasing the Multi-Objective Search Weights can be assigned to each objective to bias search and speed up global search. For example, e

32、rror in energy can easier weighted as more important to minimize than the error in energy-gradient. , even if both objectives are obtain via an analytic formula. Such weighting is an important parameter for control of time to solution.,SummaryWe find that non-dominant, multi-objective reparameteriza

33、tion of empirical Hamiltonians using Genetic Algorithms is an effective tool for developing ab initio accurate empirical potential based upon databases from high-level quantum-chemistry methods.Excited-state properties (reaction paths and structures) are in very good agreement with direct CASPT2 cal

34、culations.We find that parameters sets from one molecular system is transferable to a similar molecular system, opening the possibility of addressing more complex molecular interactions.,Ab Initio Accurate Semiempirical Potentials Excited-State Reaction ChemistryRecently, use of genetic algorithms t

35、o fit empirical potentials has grown in interest to build in more problem specific information cheaply. For example, developing an accurate empirical potential from database of high-level quantum-chemistry results is done by serial fitting to minimize error in energy differences between ground-state

36、 and excited states and then error in the energy derivative differences. Typically, however, the fitting is done in a serial fashion (first on error of energy difference, then on error in derivatives), which is not a global search. Moreover, the genetic algorithms used are not so-called competent GA

37、s developed from optimization theory, which lead to bad scaling and inefficient performance.Here we explore the use of Non-Dominant, Multi-Objective Minimization using Genetic Algorithm to reparameterize semi-empirical quantum-chemistry potentials over a global search domain using the concepts of Pa

38、reto optimization fronts.,(Un)Biased GA Multiobjective Optimization of BenzeneBiasing (here factor of 2) the error in energy over error in energy-gradient yields rapid advance of Pareto front and physical solutions.Unbiased, if left to evolve long enough, reaches biased solutions, but early solution

39、s may yield unphysical excited-state reactions.(Un)Biased solutions on the Pareto front consistently better than all previous parameterizations, including using standard GA optimization, e.g., from Martinez and coworkers, see Toniolo, et al. (2004).,Re-parameterized MNDO Hamiltonian yields relativel

40、y accurate excited-state potential energy surfaces.GA-MO-dervied MNDO S2/S1 conical intersections agree well with CASPT2, even though only included x=0 reaction coordinate in fitting.Molecular geometry for excited-states also agree well.,(Un)Biased GA Multiobjective Optimization of Ethylene, C2H4.Fo

41、und similar results to Benzene: Biased solutions on Pareto front often better than unbiased and always physical. But near the nose all solutions are physical.We find that the historical MNDO parameters are a set yielding almost unphysical solutions (see figure near 2.5 eV on error in energy).GA-MO-d

42、erived MNDO S2/S1 conical intersections agree well with CASPT2, with only x=0 reaction coordinate included in fitting.Molecular geometry for excited-states also agree well.,Transferability of the MNDO parameters: Amazingly we find that a Benzene set of parameters may be used for Ethylene and provide

43、 a solution near a Pareto set found by direct optimization.,Population Analysis for Ethylene, C2H4Must maintain large enough population to obtain full Pareto front but not so large as to waste computational resources because each solution is a full MNDO run for the set of molecular configurations us

44、ed in fitting!,Red Line is Pareto front for large population 1000.Analytic estimate suggests 760 is required to find population size.Figure show that until 800 the Pareto front is not found.For Benzene, only about 150 is required for the population size.,Future DirectionsWe will investigate the use

45、of Genetic Programming to machine-learn new and more accurate empirical potential functional forms. e.g. We will start with the original MNDO Hamiltonian and machine-learn in a molecular-specific way a GP-MNDO Hamiltonian. With this GP-MNDO Hamiltonian we can perform nearly ab initio accurate global

46、 searches of reaction pathways, which later may be studied with higher-level methods for reactions of interest.,AcknowledgementsWe thank ILLIGAL (Illinois Genetic Algorithms Lab) for use of their parallel cluster for the MO-GA optimization.This multidisciplinary effort was made possible only via support of the MCC and the National Science Foundation (Divisions of Chemistry and Materials Research).,

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1