Database Reference
In-Depth Information
Finally, note that the Mapper 's run() method is public and may be customized by
users. MultithreadedMapper is an implementation that runs mappers concurrently
in a configurable number of threads (set by mapre-
duce.mapper.multithreadedmapper.threads ). For most data processing
tasks, it confers no advantage over the default implementation. However, for mappers that
spend a long time processing each record — because they contact external servers, for ex-
ample — it allows multiple mappers to run in one JVM with little contention.
Search WWH ::




Custom Search