sqlldr 可以把文本文件导入到数据库里。
简
命令行命令:
sqlldr userid=dbUserName/dbName control=sqlldr.ctl log=sqlldr.log bad=sqlldr.bad bindsize=1048576000 rows=500000 readsize=209715200 multithreading=TRUE direct=TRUE;
sqlldr.ctl 是导入的控制文件:
load data
characterset UTF8 # 需要加载中文字符时应在控制文件里指定字符集
infile csv.txt2 # 要导入的数据文件
into table scott.caiyunlog append # 导入的目标表,导入方式是append 追加,还有其他方式见下面
(
ip terminated by '|', # 每个字段可以指定自己的分隔符
rtime terminated by '|',
method terminated by '|',
uri terminated by '|',
proto terminated by '|',
code terminated by '|',
respsize terminated by '|',
T terminated by '|',
D terminated by whitespace # 最后的那个字段没有分隔符,就用空白符,以最后的换行符为分隔符。
)
导入方式
- insert –为缺省方式,在数据装载开始时要求表为空
- append –在表中追加新记录
- replace –删除旧记录(用 delete from table 语句),替换成新装载的记录
- truncate –删除旧记录(用 truncate table 语句),替换成新装载的记录
sqlldr用法
要注意是errors参数,当错误的数据行数达到这个参数的值时会终止导入。
Usage: SQLLDR keyword=value [,keyword=value,...] Valid Keywords: userid -- ORACLE username/password control -- control file name log -- log file name bad -- bad file name data -- data file name discard -- discard file name discardmax -- number of discards to allow (Default all) skip -- number of logical records to skip (Default 0) load -- number of logical records to load (Default all) errors -- number of errors to allow (Default 50) rows -- number of rows in conventional path bind array or between direct path data saves (Default: Conventional path 64, Direct path all) bindsize -- size of conventional path bind array in bytes (Default 256000) silent -- suppress messages during run (header,feedback,errors,discards,partitions) direct -- use direct path (Default FALSE) parfile -- parameter file: name of file that contains parameter specifications parallel -- do parallel load (Default FALSE) file -- file to allocate extents from skip_unusable_indexes -- disallow/allow unusable indexes or index partitions (Default FALSE) skip_index_maintenance -- do not maintain indexes, mark affected indexes as unusable (Default FALSE) commit_discontinued -- commit loaded rows when load is discontinued (Default FALSE) readsize -- size of read buffer (Default 1048576) external_table -- use external table for load; NOT_USED, GENERATE_ONLY, EXECUTE (Default NOT_USED) columnarrayrows -- number of rows for direct path column array (Default 5000) streamsize -- size of direct path stream buffer in bytes (Default 256000) multithreading -- use multithreading in direct path resumable -- enable or disable resumable for current session (Default FALSE) resumable_name -- text string to help identify resumable statement resumable_timeout -- wait time (in seconds) for RESUMABLE (Default 7200) date_cache -- size (in entries) of date conversion cache (Default 1000) no_index_errors -- abort load on any index errors (Default FALSE) PLEASE NOTE: Command-line parameters may be specified either by position or by keywords. An example of the former case is 'sqlldr scott/tiger foo'; an example of the latter is 'sqlldr control=foo userid=scott/tiger'. One may specify parameters by position before but not after parameters specified by keywords. For example, 'sqlldr scott/tiger control=foo logfile=log' is allowed, but 'sqlldr scott/tiger control=foo log' is not, even though the position of the parameter 'log' is correct.
欢迎关注我的微信公众号: coderbee笔记,可以更及时回复你的讨论。